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Methods for Obtaining Plant Varieties 

TECHNICAL FIELD 

The present invention relates to nucleotide sequences which encode polypeptides 
involved in the DNA mismatch repair systems of plants, and to the polypeptides encoded 
5 by those nucleotide sequences. The invention also relates to nucleotide sequences and 
polypeptide sequences for use in altering the DNA mismatch repair system in plants. The 
invention also relates to a process for altering the DNA mismatch repair system of a plant 
cell, to a process for increasing genetic variations in plants and to processes for obtaining 
plants having a desired characteristic. 

1 o BACKGROUND OF THE INVENTION 

Plant breeding essentially relies on and makes use of genetic variation which occurs 
naturally within and between members of a family, a genus, a species or a subspecies. 
Another source of genetic variation is the introduction of genes from other organisms 
which may or may not be related to the host plant. m 

is Allelic loci or non-allelic genes which constitute or contribute to desired quantitative 
(e.g. growth performance, yield, etc.) or qualitative (e.g. deposition, content and 
composition of seed storage products; pathogen resistance genes: etc.) traits that are 
absent, incomplete or inefficient in a species or subspecies of interest are typically 
introduced by the plant breeder from other species or subspecies, or de novo. This 

20 introduction is often done by crossing, provided that the species to be crossed are sexually 
compatible. Other means of introducing genomes, individual chromosomes or genes into 
plant cells or plants are well known in the art. They include cell fusion, chemically aided 
transfection (Schocher et al., 1986, Biotechnology 4: 1093) and ballistic (McCabe et al., 
1988, Biotechnology 6: 923), microinjection (Neuhaus et al., 1987, TAG 75: 30), 

25 electroporation of protoplasts (Chupeau et al., 1989, Biotechnology 7: 53) or microbial 
transformation methods such as Agrobacterium mediated transformation (Horsch et al., 
1985, Science 227: 1229; Hiei et al., 1996, Biotechnology 14: 745). 

However, when a foreign genome, chromosome or gene is introduced into a plant, it 
will often segregate in subsequent generations from the genome of the recipient plant or 
30 plant cell during mitotic and meiotic cell divisions and, in consequence, become lost from 
the host plant or plant ceil into which it had been introduced. Occasionally, however, the 
introduced genome, chromosome or gene physically combines entirely or in part with the 
genome, chromosome or gene of the host plant or plant cell in a process which is called 
recombination. 

35 Recombination involves the exchange of covalent linkages between DNA molecules 
in regions of identical or similar sequence. It is referred to here as homologous 
recombination if donor and recipient DNA are identical or nearly identical (at least 99% 
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base sequence identity), and as homeologous recombination if donor and recipient DNA 
are not identical but are similar (less than 99% base sequence identity). 

The ability of two genomes, chromosomes or genes to recombine is known to 
depend largely on the evolutionary relation between them and thus on the degree of 
s sequence similarity between the two DNA molecules. Whereas homologous 
recombination is frequently observed during mitosis and meiosis, homeologous 
recombination is rarely or never seen. 

From a breeder's perspective, the limits within which homologous recombination 
occurs, therefore, define a genetic barrier between species, varieties or lines, in contrast 

io to homeologous recombination which can break this barrier. Homeologous recombination 
is thus of great importance for plant breeding. Accordingly there is a need for a process 
for enhancing the frequency of homeologous recombination in plants. In particular, there 
is a need for a process of increasing homeologous recombination to significantly shorten 
the length of breeding programs by reducing the number of crosses required to obtain an 

is otherwise rare recombination event. 

At least in Escherichia coli, homologous and homeologous recombination are known 
to share a common pathway that requires among others the proteins RecA, RecB, RecC. 
RecD and makes use of the SOS induced RuvA and RuvB, respectively. It has been 
suggested that mating induced recombination follows the Double-Strand Break Repair 

20 model (Szostak et al., 1983, Cell 33, 25-35), which is widely used to describe genetic 
recombination in eukaryotes. Following the alignment of homologous or homeologous 
DNA double helices the RecA protein mediates an exchange of a single DNA strand from 
the donor helix to the aligned recipient DNA helix. The incoming strand screens the 
recipient helix for sequence complementarity, seeking to form a heteroduplex by hydrogen 

25 bonding the complementary strand. The displaced homologous or homeologous strand of 
the recipient helix is guided into the donor helix where it base pairs with its counterpart 
strand to form a second heteroduplex. The resulting branch point then migrates along the 
aligned chromosomes thereby elongating and thus stabilising the initial heteroduplexes. 
Single stranded gaps (if present) are closed by DNA synthesis. The strand cross overs 

30 (Holliday junction) are eventually resolved enzymatically to yield the recombination 
products. 

Although in wild type E. coli homologous and homeologous recombination are thus 
mechanistically similar if not identical, homologous recombination in conjugational 
crosses E. coli x E. coli occurs five orders of magnitude more frequently than 
35 homeologous recombination in conjugational crosses £. coli x S. typhimurium (Matic et 
al. 1995; Cell 80, 507-^15). The imbalance in favour of homologous recombination was 
shown to be caused largely by the bacterial MisMatch Repair (MMR) system since its 
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inaciivation increased the frequency of homeologous recombination in E. coli up to 1000 
fold (Rayssiguier et al. 1989. Nature 342. 396-401). 

In E. coli. the MMR system (reviewed by Modrich 1991, Annual Rev Genetics 25. 
229-253) is composed of only three proteins known as MutS, MutL and MutH. MutS 
s recognizes and binds to base pair mismatches. MutL then forms a stable complex with 
mismatch bound MutS. This protein complex now activates the MutH intrinsic single 
stranded endonuclease which nicks the strand containing the misplaced base and thereby 
prepares the template for DNA repair enzymes. 

During recombination, MMR components inhibit homeologous recombination. In 
io vitro experiments demonstrated that MutS in complex with MutL binds to mismatches at 
the recombination branch point and physically blocks RecA mediated strand exchange and 
heteroduplex formation (Worth et al., 1994; PNAS 91. 3238-3241). Interestingly, the 
SOS dependent RuvAB mediated branch migration is insensitive to MutS/MutL, 
explaining the observed slight increase in SOS dependent homeologous recombination. 
1 5 Homeologous mating even induces the SOS response, thereby taking advantage of RuvA£ 
induction (Matic et al. 1995. Cell 80. 507-515). 

The MMR system thus appears to be a genetic guardian over genome stability : tn E. 
coli. In this role it essentially determines the extent to which genetic isolation, that is, 
speciation. occurs. The diminished sensitivity of the SOS system to MMR, however, 
20 allows (within limits) for rapid genomic changes at times of stress, providing the means 
for fast adaptation to altered environmental conditions and thus contributing to intraspecies 
genetic variation and species evolution. 

The important role of MMR in preserving genomic integrity has been established 
also in certain eukaryotes. In its efficiency, the human MMR, for example, may even 

25 counteract potential gene therapy tools such as triple-helix forming oligonucleotides 
including RNA-DNA hybrid molecules (Havre et al., 1993, J. Virology 67: 7234-7331; 
Wang et al., 1995, Mol. Cell. Biol. 15: 1759-1768; Kotani et al., 1996, Mol. Gen. 
Genetics 250: 626-634; Cole-Strauss et al., 1996, Science 273: 1387-1389). Such 
oligonucleotides are designed to introduce single base changes into selected DNA target 

30 sequences in order to inactivate for example cancer genes or to restore their normal 
function. The resulting base mismatches however are recognised by the mismatch repair 
system which then directs removal of the mismatched base, thereby reducing the 
efficiency of oligonucleotide induced site-specific mutagenesis. 

To date, two families of related genes, homologous to the bacterial MutS and MutL 
35 genes have been identified or isolated in yeast and mammals (recent reviews by Arnheim 
and Shibata, 1997, Curr. Opinion Genet. Dev. 7, 364-370; Modrich and Lahue, 1996, 
Annual Rev. Biochem. 65, 101-133; Umar and Kunkel, 1996, Eur. J. Biochem. 238, 297- 
307). Biochemical and genetic analysis indicated that eukaryotic MutS homologs (MSH) 
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and MutL homologs (MLH. PMS), respectively, fulfil similar protein functions as their 
bacterial counterparts. Their relative abundance, however, could reflect different 
mismatch specificity and/or specialisation for different tissues or organelles or 
developmental processes such as mitotic versus meiotic recombination. 

5 To date, six different genes homologous to MutS have been isolated in yeast 
(yA/5//). and their homologs have been found in mouse (mMSH) and human (hMSH), 
respectively. Encoded proteins yMSH2, yMSH3 and yMSH6 appear to be the main MutS 
homologs involved in MMR during mitosis and meiosis in yeast, where the 
complementary proteins MSH3 and MSH6 alternatively associate with MSH2 to recognise 
10 different mismatch substrates (Masischky et ah, 1996, Genes Dev. 10, 407-420). Similar 
protein interactions have been demonstrated for the human homologs hMSH2, hMSH3 
and hMSH6 (Acharya et a!., 1996, PNAS 93, 13629-13634). 

MutL homologs (MLH and PMS), recently reviewed by Modrich and Lahue (1996, 
Annual Rev. Biochem. 65. 101-133) have so far been found in yeast (yMLHl and 

is yPMSl), mouse (mPMS2) and human (hMLHl. hPMSl and hPMS2). The hPMSZ is 5 
member of a family of at least 7 genes (Horii et a!.. 1994, Biochem. Biophys. Res. 
Commun. 204, 1257-1264) and its gene product is most closely related to yPMSl. Prolla 
et al. (1994, Science 265, 1091-1093) presented evidence for yPMSl and yMLHl to 
physically associate with each other and, together, to interact with the MutS homolog 

20 yMSH2 to form a ternary complex involved in mismatch substrate binding. 

However, while medical interest in mismatch repair has prompted extensive research 
on MMR in bacteria, yeast and mammals, MMR genes have not been isolated from higher 
plants prior to the present invention and no attempts to adjust the plant MMR to plant 
breeding needs have been reported. 

25 SUMMARY OF THE INVENTION 

According to a first embodiment of the invention, there is provided an isolated and 
purified DNA molecule comprising a polynucleotide sequence encoding a polypeptide 
functionally involved in the DNA mismatch repair system of a plant. In one form of this 
embodiment, the invention provides an isolated and purified DNA molecule comprising a 

30 polynucleotide sequence encoding a polypeptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human. More particularly, the invention provides 
polynucleotide sequences encoding polypeptides which are homologous to the mismatch 
repair polypeptides MSH3 and MSH6 of Saccharomyces cerevisiae. Still more 
particularly, the invention provides the coding sequences of the genes AtMSH3 and 

35 AtMSH6 of Arabidopsis thaliana, as defined hereinbelow, and polynucleotide sequences 
encoding polypeptides which are homologous to polypeptides encoded by AtMSH3 and 
AtMSH6. 
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According to a second embodiment of the invention, there is provided an isolated 
and purified polypeptide functionally involved in the DNA mismatch repair system of a 
plant, for example a polypeptide which is homologous to a mismatch repair polypeptide of 
a yeast or of a human such as a polypeptide encoded by the genes AtMSH3 or A1MSH6 of 
5 Arabidopsis thaliana, as defined hereinbelow. 

According to a third embodiment of the invention, there is provided an isolated and 
purified DNA molecule comprising a polynucleotide sequence selected from the group 
consisting of (i) a sequence encoding a polynucleotide which is capable of interfering with 
the expression of a plant polynucleotide sequence encoding a polypeptide which is 
10 homologous to a mismatch repair polypeptide of a yeast or of a human and thereby 
disabling said plant polynucleotide sequence; and (ii) a sequence encoding a polypeptide 
capable of disrupting the DNA mismatch repair system of a plant. 

According to a fourth embodiment of the invention there is provided a chimeric gene 
comprising a DNA sequence selected from the group consisting of (i) a sequence encoding 

is a polynucleotide which is capable of interfering with the expression of a plapt 
polynucleotide sequence encoding a polypeptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human and thereby disabling said plant polynucleotide 
sequence, and (ii) a sequence encoding a polypeptide capable of disrupting the DNA 
mismatch repair system of a plant: together with at least one regulation element capable of 

20 functioning in a plant cell. Examples of such regulation elements include constitutive, 
inducible, tissue type specific and cell type specific promoters such as 35S. NOS, PRla, 
AoPRl and DMC1. Typically, a chimeric gene of the fourth embodiment will also 
include at least one terminator sequence, more typically exactly one terminator sequence. 

In the third and fourth embodiments, said interference, by said polynucleotide 
25 sequence, with the expression of a plant polynucleotide sequence encoding a polypeptide 
which is homologous to a mismatch repair peptide of a yeast or a human typically occurs 
by hybridisation or by co-suppression. 

According to a fifth embodiment of the invention there is provided a plasmid or 
vector comprising a chimeric gene of the fourth embodiment. A vector of the fifth 
30 embodiment may be, for example, a virai vector or a bacterial vector. 

According to a sixth embodiment of the invention, there is provided a plant cell 
stably transformed, transfected or electroporated with a plasmid or vector of the fifth 
embodiment. 

According to seventh embodiment of the invention, there is provided a plant 
35 comprising a cell of the sixth embodiment. 

According to an eighth embodiment of die. invention, there is provided a process for 
at least partially inactivating a DNA mismatch repair system of a plant cell, comprising 
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transforming or transfecting said plant cell with a DNA sequence of the third embodiment 
or a chimeric gene of the fourth embodiment or a plasmid or vector of the fifth 
embodiment, and causing said DNA sequence to express said polynucleotide or said 
polypeptide. 

5 According to a ninth embodiment of the invention, there is provided a process for 
increasing genetic variation in a plant comprising obtaining a hybrid plant from a first 
plant and a second plant, or cells thereof, said first and second plants being genetically 
different; altering the mismatch repair system in said hybrid plant; permitting said hybrid 
plant to self-fertilise and produce offspring plants; and screening said offspring plants for 

10 plants in which homeologous recombination has occurred. For example, homeologous 
recombination may be evidenced by new genetic linkage of a desired characteristic trait or 
of a gene which contributes to a desired characteristic trait. 

According to a tenth embodiment of the invention there is provided a process for 
obtaining a plant having a desired characteristic, comprising altering the mismatch repair 
is system in a plant, cell or plurality of cells of a plant which does not have said desired 
characteristic, permitting mutations to persist in said cells to produce mutated plant cells, 
deriving plants from said mutated plant cells, and screening said plants for a plant having 
said desired characteristic. 

In a preferred form of the ninth and tenth embodiments of the invention, the step of 
20 altering the mismatch repair system comprises introducing into said hybrid plant, plant, 
cell or cells a chimeric gene of the fourth embodiment and permitting the chimeric gene to 
express a polynucleotide which is capable of interfering with the expression of a plant 
polynucleotide sequence in a mismatch repair gene of the hybrid plant, plant, cell or cells, 
or a polypeptide capable of disrupting the DNA mismatch repair system of the hybrid 
25 plant or cells. 

In other embodiments, the invention provides (a) an oligonucleotide capable of 
hybridising at 45°C under standard PCR conditions to a DNA molecule of the first 
embodiment; (b) an oligonucleotide capable of hybridising at 45°C under standard PCR 
conditions to the DNA of SEQ ID NO: 18 and (c) an oligonucleotide capable of 
30 hybridising at 45°C under standard PCR conditions to the DNA of SEQ ID NO:30; with 
the proviso that the oligonucleotide of (a), (b) and (c) is other than SEQ ID NO:l or SEQ 
ID NO:2. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure I provides a diagrammatic representation of the primer sequences used to 
35 isolate AtMSH3. 

Figure 2 is a plasmid map of clone 52, showing restriction enzyme cleavage sites in 
the 5' half of the full-length cDNA for ArMSH:k 
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Figure 3 is a plasmid map of clone 13. showing restriction enzyme cleavage sites in 
the 3' half of ihe full-length cDNA for AtMSH3. 

Fitzure 4 is a sequence listing of the coding sequence of AtMSH3, together with a 
deduced sequence of the encoded polypeptide. 
5 Figure 5 is a protein alignment of yeast (Saccharomyces cerevisiae) and Arabidopsis 

thaliana MSH3 protein. 

Figure 6 provides a diagrammatic representation of the primer sequences used to 

isolate AIMSH6. 

Figure 7 is a plasmid map of clone 43 , showing restriction enzyme cleavage sites in 
io the 5* half of the full-length cDNA for AtMSH6. 

Figure 8 is a plasmid map of clone 62. showing restriction enzyme cleavage sites in 
the 3* half of the full-length cDNA for AtMSH6. 

Figure 9 is a sequence listing of the coding sequence of AtMSH6, together with a 
deduced sequence of the encoded polypeptide, 
is Figure 10 is a protein alignment of yeast (Saccharomyces cerevisiae) and 

Arabidopsis ihaliana MSH6 protein. 

Fiizure 1 1 is a genomic sequence listing of ArMSH6. 

Figure 12 is a plasmid map of plasmid pPF13. V~ 
Figure 13 is a plasmid map of plasmid pPF14. 
20 Figure 14 is a plasmid map of plasmid pCW186. 

Figure 15 is a plasmid map of plasmid pCW 187. 
Figure 1 6 is a plasmid map of plasmid pPF66. 
Figure 1 7 is a plasmid map of plasmid pPF57. 

Fisure 18 is a diagrammatic representation of an antisense gene construction for use 
25 in homeologous meiotic recombination. 

Figure 19 is a plasmid map of plasmid p3243. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention is based on the inventors' discovery that there exist in higher 
plants genes which are homologous to MMR genes in E. co/i, and to MMR genes in 
30 yeasts and humans. 

Thus, the inventors have identified genes, herein designated AtMSH3 and AtMSH6> 
of the plant Arabidopsis thaliana which encode the proteins AtMSH3 and AtMSH6. 
These plant proteins are homologous to yMSH3 and yMSH6, respectively. The present 
inventors have isolated cDNAs encoding the proteins AtMSH3 and AtMSH6 and have 
35 isolated the complete gene encoding AtMSH6. Given the teaching herein, other genes 
(for example AtMSH2. and genes of other plants) may be obtained which are involved in 
DNA mismatch repair in plants, including other genes which encode polypeptides 
homologous 'to MMR proteins! of yeasts or humans, such , as genes which encode 
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polypeptides homologous to yeast MSH2. MLHl or PMS2, or to human MLHL PMSl or 
PMS2. For example, given the teaching herein, genes of members of the Brassicaceae 
family or of other unrelated families, for example the Poaceae. the Solanaceae. the 
Asteraceae, the Malvaceae, the Fabaceae, the Linaceae. the Canabinaceae. the Dauaceae 
5 and the Cucurbitaceae family, and which encode polypeptides homologous to MMR 
proteins of yeasts or humans may be obtained. 

Examples of plants whose genes encoding polypeptides homologous to MMR 
proteins of yeasts or humans may be obtained given the teaching herein include maize, 
wheat, oats, barley, rice, tomato, potato, tobacco, capsicum, sunflower, lettuce, 
10 artichoke, safflower, cotton, okra, beans of many kinds including soybean, peas, melon, 
squash, cucumber, oilseed rape, broccoli, cauliflower, cabbage, flax, hemp, hops and 
carrot. 

Within the meaning of the present invention, a first polypeptide is defined as 
homologous to a second polypeptide if the amino acid sequence of the first polypeptide 
is exhibits a similarity of at least 50% on the polypeptide level to the amino acid sequence <5f 
the second polypeptide. 

A procedure which may be followed to obtain genes AtMSH3 and AtMSH6 is 
described in Example I. Essentially the same technique may be applied to obtain other 
mismatch repair genes of Arabidopsis thaliana. and essentially the same technique as 

20 exemplified herein may be applied to cDNA obtained by reverse transcription of RNA 
from other plants. Alternatively, given the sequence information disclosed herein, other 
degenerate oligonucleotide primers, especially oligonucleotides of the invention which are 
capable of hybridising at 45°C under standard PCR conditions (such as the conditions 
described in Example 1 using primers UPMU and DOMU) to AtMSH3 and/or AtMSH6 

25 may be designed and obtained for use in isolating sequences of plant mismatch repair 
genes which are homologous to AtMSH3 or AtMSH6, from other plants. Similarly, 
oligonucleotides of the invention which are capable of hybridising at 45°C under standard 
PCR conditions to plant mismatch repair genes of plants other than Arabidopsis thaliana 
also fall within the scope of the present invention and may be utilised to obtain mismatch 

30 repair genes of still other plants. Typically, such oligonucleotides are capable of 
hybridising at 45°C under standard PCR conditions to a DNA molecule which encodes a 
polypeptide which is homologous to a mismatch repair polypeptide of a yeast or a human. 
The temperature at which oligonucleotides of the invention hybridise to AtMSH3 and/or 
AtMSH6, or to plant mismatch repair genes of plants other than Arabidopsis thaliana, or 

35 to DNA molecules which encode polypeptides which are homologous to a mismatch repair 
polypeptide of a yeast or a human may be higher than 45°C for example at least 50°C, or 
at least 55°C, or at least 60°C or as high as 65°C. L 
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The successful gene isolation disclosed herein demonstrates tor the first time the 
existence of MMR in higher plants and indicates the presence of other plant MMR genes. 
For example, genes encoding the plant homologs of MSH1. MSH2. MSH4, MSH5, 
PMSL PMS2 and MLHl may be identified given the teaching herein. Such genes, as 

5 well as those specifically described herein, separately or in combination, are useful in 
manipulating the plant MMR for plant breeding purposes. Thus, for example, the plant 
MMR may be altered by including in a plant cell a polynucleotide sequence as defined 
herein above with reference to the third embodiment of the invention, and causing the 
polynucleotide sequence to express either a polynucleotide which disables a plant MMR 

io gene, or a polypeptide which disrupts the plant's MMR system. 

The DNA molecule of the third embodiment of the invention includes a 
polynucleotide sequence (herein referred to as a MMR altering gene) which may for 
example encode sense, antisense or ribozyme molecules characterised by sufficient base 
sequence similarity or complementarity to the gene to be altered to permit the antisense or 
is ribozyme molecule to hybridise with the plant MMR gene in vivo or to permit the sense 
molecule to participate in co-suppression. Alternatively, the MMR altering gene may 
encode a protein or proteins which interfere with the activity of a plant MMR protein and 
thus disrupt the plant 1 s MMR system. For example, such encoded proteins n\ay be 
antibodies or other proteins capable of interfering with MMR protein function, such^as by 
20 complexing with a protein functionally involved in plant MMR thereby disrupting the 
MMR of the plant. An example of such a protein is the MSH3 protein of Arabidopsis 
thaliana described herein or a protein of another plant which is homologous to the MSH3 
protein of A. thaliana. For instance, overexpression of MSH3 in a plant cell causes 
MSH2 present in the cell to be substantially completely complexed, disrupting the 
25 mismatch repair mechanism or mechanisms in the cell which are functionally dependent 
on the presence of a complex of MSH2 with MSH6. Similarly, mismatch repair 
mechanisms which depend on the presence of a complex of MSH2 and MSH3 may be 
disrupted by the overexpression of MSH6. 

A chimeric gene of the fourth embodiment, incorporating a MMR altering gene, 
30 may be prepared by methods which are known in the art. Similarly, the MMR altering 
gene may be introduced into a plant cell, regenerating tissue or whole plant by techniques 
known in the an as being suitable for plant transformation, or by crossing. Known 
transformation techniques include Agrobacterium tumefaciens or A. rhizogenes mediated 
gene transfer, ballistic and chemical methods, and electroporation of protoplasts. 
35 The MMR altering gene or genes are typically expressed from suitable promoters. 
Suitable promoters may direct constitutive expression, such as the 35S or the NOS 
promoter. Usually, however, the promoter will direct either inducible or tissue specific 
(e.g. callus; embryonic tissue; etc.), cell type specific (e.g. protoplasts: meiocytes; etc.) 
or developmental (e.g. embryo) expression of the altering gene or genes, in order for the 
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MMR system to function in tissue types or cell types, or at developmental stages of the 
plant, in which it is not desirable for the MMR system to be altered. Using such 
promoters, therefore, the activity of a MMR altering gene may be limited to a specific 
stage during plant development or it may be altered by controlling conditions external to 
s the plant, and the deleterious effects of a permanently disabled or altered DNA mismatch 
repair system in a plant may be avoided. Examples of suitable promoters which are not 
constitutive are known in the art and include inducible promoters such as PRla. (reviewed 
by Gatz, 1997. Annual Rev. Plant Phys. Plant Mol. Biol. 48: 89), tissue specific 
promoters such as AoPRl (Sabahattin et al., 1993, Biotechnology 11: 218), and cell-type 
io specific promoters such as DMC\. 

A chimeric gene in accordance with the invention may further be physically linked 
to one or more selection markers such as genes which confer phenotypic traits such as 
herbicide resistance, antibiotic resistance or disease resistance, or which confer some 
other recognisable trait such as male sterility, male fertility, grain size, colour, growth 
is rate, flowering time, ripening time. etc. r 

The process of the tenth embodiment of the invention provides, for example, a 
process for generating intraspecies genetic variation by altering the mismatch repair 
system in a plant cell, in regenerating plant tissue or in a whole plant. The plant cell, 
regenerating tissue or whole plant includes and expresses one or more MMR altering 

20 genes which are capable of altering mismatch repair in the plant cell, regenerating tissue 
or whole plant. Alteration of MMR may be achieved, for example, by inactivating the 
genes encoding plant MSH3 and/or plant MSH6. It is preferred to inactivate the plant 
MSH3 and MSH6 encoding genes at the same time and in the same plant cell, 
regenerating tissue or whole plant. Typically in this preferred form of the invention 

25 inactivation of either plant MSH3 or MSH6 alone is insufficient to substantially alter the 
plant* s mismatch repair system and only when both MSH3 and MSH6 are inactivated 
simultaneously is the plant's mismatch repair system sufficiently altered to prevent the 
MMR system from recognising base pair mismatches, base insertions or deletions as a 
result of DNA replication errors. DNA damage, or oligonucleotide induced site-specific 

30 mutagenesis. However, in some applications of the invention, inactivation of only one 
gene may also be used to cause genomic instability or increase the efficiency of site- 
specific mutagenesis. 

If desired, the MMR altering gene or genes may later be rendered non-functional or 
ineffective, or may be removed from the genome of the plant cell, regenerating tissue or 
35 whole plant in order to restore mismatch repair in the plant cell, regenerating tissue or 
whole plant. The MMR altering gene or genes may be inactivated by means of known 
gene inactivation tools, such as ribozymes, or may be removed from (he genome using 
gene elimination systems known in the an, such as CRE/LOX. It is preferred to render 
two genes, whose gene products combine to incapacitate MMR, ineffecfive by separating 
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the altering genes through segregation. Therefore, in a preferred embodiment of the 
invention a first plant cell or plant is generated in which only plant MSH3 is incapacitated, 
and a second plant cell or plant is generated in which only plant MSH6 is incapacitated. 
The combination of both genomes, for example by crossing, then produces significant 
5 MMR deficiency in those cells or plants which have inherited both altering genes. If the 
alterine genes are expressed from unlinked loci, gene segregation restores MMR activity 
in the progeny of the cells or plants. 

In a process of the ninth embodiment of this invention, homeologous recombination 
is enhanced between different genomes, chromosomes or genes in plant cells or plants by 
10 altering MMR in said plant cells or plants. Such genomes, chromosomes or genes are 
characterised in that they originate from different plant families, genera, species, 
subspecies, plant varieties or lines. Hybrid plant cells or hybrid plants may be produced 
by crossing, by cell fusion or by other techniques known in the art. These plant cells or 
plants are further characterised by expressing one or more genes that are capable of 
15 altering mismatch repair in the plant cell or plants. 

In the process of the ninth embodiment, the homeologous recombination is typically 
for the purpose of introducing a desired characteristic in the hybrid plant. In this typical 
application of the process of the ninth embodiment, and in the process of the tenth 
embodiment the desired characteristic may be any characteristic which is of value to the 
20 plant breeder. Examples of such characteristics are well known in the art and include 
altered composition or quality of leaf or seed derived storage products (e.g. oil, starch, 
protein), altered composition or quality of cell walls (e.g. decrease in lignin content), 
altered growth rate, prolonged flowering, increased plant yield or grain yield, altered 
plant morphology, resistence to pathogens, tolerance to or improved performance under 
25 environmental stresses of various kinds, etc. 

In a preferred form of the tenth embodiment, an MMR altering gene is co- 
introduced along with the homeologous genome, chromosome or gene of another plant 
cell or plant into an MMR proficient plant cell or MMR proficient plant to produce a 
hybrid plant cell or hybrid plant in which homeologous recombination can occur. 
30 Suitably, the MMR proficient plant cell or MMR proficient plant may also include an 
MMR altering gene. For example a gene capable of inactivating plant MSH3 may be co- 
introduced along with the homeologous genome, chromosome or gene of another plant 
cell or plant into an MMR proficient plant cell or MMR proficient plant in which MSH6 
is inactivated. A resultant hybrid plant in which homeologous recombination occurs will 
35 include both the MSH3 and MSH6 altering genes and its MMR system will therefore be 

s 

inactivated. 

.* ■■» 

In this form of the invention; if hybrid plants are to be produced by crossing, the 
MMR altering gene preferably originates from the male parent, thus ensuring that the 
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MMR altering gene is always introduced and is not present in the recipient cell. That is, 
the MMR of the recipient cell, prior to introduction of the MMR altering gene, is 
typically proficient. Alternatively, if an MMR altering gene is present in a recipient cell it 
may be ineffective or inefficient on its own. or it may be linked to an inducible or tissue 
5 specific or cell type specific promoter which only renders the MMR altering gene active 
under limited conditions. 

Thus, in a preferred form of the process of the ninth embodiment, the MMR system 
of the hybrid plant is initially unaltered. In this form of the process, the step of altering 
the mismatch repair system may comprise introducing into the hybrid plant, or cells 
io thereof, a MMR altering gene, such as by Agrobacrerium twnefaciens or A. rhizogenes 
mediated gene transfer, ballistic and chemical methods, and electroporation of protoplasts. 

The MMR altering gene or genes are typically expressed from suitable promoters, as 
described above. Preferably, the promoter is transcriptionally active in mitotically and 
meiotically active tissue and/or cells to ensure MMR alteration after chromosome pairing 

15 at mitosis and meiosis, respectively. The preferred timing for MMR alteration is aft 
meiosis, because recombinant genomes, chromosomes or genes are directly transmitted to 
the progeny. A suitable meiocyte specific promoter is for example the DMC1 promoter 
from Arabidopsis thaliana ssp. Ler. (Klimyuk and Jones, 1997, Plant J. 11, 1-14). 
However, mitotic homeologous recombination is also a desirable outcome as somatic 

20 recombination events can be transmitted to offspring due to the totipotency of plant cells 
and the lack of predetermined germ cells in plants. 

If desired, the MMR altering gene or genes may later be rendered non-functional or 
ineffective, or may be removed from the hybrid plant or hybrid plant cells, in order to 
restore mismatch repair in the hybrid plant or hybrid plant cells. The MMR altering gene 
25 or genes may be inactivated by means of known gene inactivation tools as described 
herein above. 

EXAMPLES 

Example 1. Cloning of the AtMSH3 and AtMSH6 coiling sequences 

Isolation of partial AtMSH3 and AtMSH6 consen sus sequences 

30 Degenerate oligonucleotides UPMU (SEQ ID NO: 1 ) and DOMU (SEQ ID NO:2) 

UPMU CTGGATCCACIGGICCIAA(C/T)ATG 
DOMU CTGGATCC(A/G)TA(A/G)TGIGTI(A/G)C(A/G)AA 
were used to isolate AtMSH3 and AtMSH6 sequences by PCR amplification. 

Primers UPMU and DOMU correspond to conserved amino acid sequences of the 
35 proteins MutS (£L coli and S. typhimurium\ HexA (5. pneumoniae). Repl (mous^) and 
Duel (human). The conserved regions to which they are targeted are TGPNM for UPMU 
(amino acid positions 852-856 for AtMSH6 and 816-820 for AtMSH3) FATHY or FVTHY 
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tor DOMU f amino acid positions 964-968 for AtMSH6 and 928-932 tor AtMSH3. 
respectively.) These primers have been used to isolate MSH2 and MSHI from yeast 
(Reenan and Kolodner. Genetics 132: 963-973 ( 1992)) and MSH2 from Xenopus and mouse 
(Varlet et al,. Nuc. Acids Res. 22:5723-5728 (1994)). 
s Template single strand cDNA was produced by reverse transcription of 2 ug total 

RNA from a cell suspension culture of Arabidopsis thaliana ecotype Columbia (Axelos et 
al. 1989, Mol. Gen. Genetics 219: 106-112). The PCR reaction was performed under the 
following conditions in a final volume of lOOul: 0.2mM dNTP, luM each primer, 1XPCR 
buffer, lu Tag DNA polymerase (Appligene) in the presence of template cDNA. PCR 
10 parameters were 5 minutes at 94°C, followed by 30 cycles of 40 seconds at 95°C. 90 
seconds at 45°C, 1 minute at 72°C. The amplification products were cloned into pGEM-T 
vector (Promega) and sequenced. Two different clones were isolated, S5 (350bp) was 
homologous to MSH3, S8 (327bp) was homologous to MSH6. Complete cDNA sequences 
were then isolated according to the Marathon cDNA amplification kit procedure (Clontech). 
is In summary, this procedure involves producing double stranded cDNA by reverse 
transcription of 2ug polyA^ RNA from the cell suspension culture of Arabidopsis* 
Adaptors are ligated on each side of the cDNA. The ligated cDNA is used as a template for 
5* and 3' RACE PCR reactions in the presence of primers that are specific for the adaptor on 
one side (API and AP2), and specific for the targeted gene on the other side. A 5/. and a 3* 
20 fragment that overlap are thus produced for each gene. The complete gene coding sequence 
can be reconstituted taking advantage of a unique restriction site, if available, in the 
overlapping region. Specific details of this procedure as it was used to isolate AtMSH3 and 
AtMSH6 coding regions, are as follows. 
Isolation of At MS H 3 complete co ding sequence 

25 From the sequence of clone S5, primer 636 (SEQ ID NO:3) was designed: 

636 TGCTAGTGCCTCTTGCAAGCTCAT. 
Primer API (SEQ ID NO:4) is complementary to a portion of an adaptor sequence 
which had been ligated to the 5' and 3' ends of Arabidopsis cDNA: 

API CCATCCTAATACGACTCACTATAGGGC. 
30 PCR performed on the ligated cDN A with primers 636 and API for the 5 ' RACE PCR 
was followed by a second round of amplification with the nested primers AP2 (SEQ ID 

NO:5) and S525 (SEQ ID NO:6) 

AP2 ACTCACTATAGGGCTCGAGCGGC 
S525 AGGTTCTGATTATGTGTGACGCTTTACTTA 
35 (the latter was also designed to correspond to a part of the sequence of clone S5) and 
produced a 2720bp DNA fragment. Figure 1 provides a diagrammatic representation of the 
primer sequences used to isolate AtMSH3. Another primer (S5 1 , SEQ ID NO:7) 
S5I GGATCGGGTACTGGGTTTTGAGTGTGAGG 
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was designed closer to the 5* border and permitted the determination of 99bp upstream to 
the Aid initiation eodon. For the 3* RACE PCR. a first PCR reaction was performed with 
primers API and 635 (SEQ ID N():8). 

635 GCACGTGCTTGATGGTGTTTTCAC 
5 followed by a second round of amplification, using the nested primers AP2 and S523 (SEQ 
IDNO:9) 

S523 TCAGACAGTATCCAGCATGGCAGAAGTA 
which produced a DNA fragment of 890bp. Both DNA fragments were subcloned into 
pGEM-T and sequenced. Since PCR amplification using the Expand Long Template PCR 
io System (Boehringer-Mannheim) produced errors in the sequence, new oligonucleotides 
were designed to isolate those sequences again by PCR. but with the high fidelity DNA 
polymerase Pfu. PCR with primers 1S5 (SEQ ID NO: 10) and S53 (SEQ ID NO: 1 1) 
I S5 ATCCCGGGATGGGCAAGCAAAAGCAGCAGACGA 
S53 GACAAAGAGCGAAATGAGGCCCCTTGG 
is amplified the 1244bp fragment clone 52 (SEQ ID NO:12, cloned into pUCl 8/Smal). PCR 
with primers S52 (SEQ ID NO: 1 3 ) and 2S5 (SEQ ID NO: 14) * 
2S5 ATCCCGGGTCAAAATGAACAAGTTGGTTTTAGTC 
S52 GCCACATCTGACTGTTCAAGCCCTCGC 
amplified the 2l04bp clone 13 (SEQ ID NO: 15, cloned into pUC18/Smal). The complete 
20 coding sequence of the ArMSH3 gene was reconstructed in pUC18 by ligating the J' half 
of AtMSH3 (clone 52) to the 3' half of AtMSH3 (clone 13) after digesting with BamHl 
which has a unique cleavage site in the overlapping region of both clones. This 
manipulation yielded plasmid pPF26. The Smal fragment from pPF26 contains the 
complete ArMSH3 coding sequence. The remaining primers referred to in Figure 1 are as 
25 follows: 

S5 1 GGATCGGGTACTGGGTTTTGAGTGTGAGG (SEQ ID NO: 16) 

S525 - AGGTTCTGATTATGTGTGACGCTTTACTTA (SEQ ID NO: 17) 
Figures 2 and 3 provide plasmid maps of clones 52 and 13 respectively, showing 
restriction enzyme cleavage sites. The complete AtMSH3 coding sequence (SEQ ID NO: 18) 
30 is 3246bp long and is shown in Figure 4 together with the deduced sequence (SEQ ID 
NO: 19) of the encoded polypeptide. AiMSH3 is clearly homologous to the yeast and mouse 
MSH3 genes. A sequence alignment of polypeptides encoded by AtMSH3 and that encoded 
by Saccharomyces cerevisiae MSH3 is set out in Figure 5. 

Isolation of the AtMSH6 complete coding sequence and genomic sequences 

35 The same procedure allowed isolation of the AtMSH6 cDNA. Figure 6 provides a 
diagrammatic representation of the primer sequences used to isolate AtMSH6. For the 5* 
RACE PCR, primers 638 (SEQ ID NO:20) and API (SEQ ID NO:4) 

638 TCTCTACC AGGTGACG AAA AACCG 

allowed the amplification of a 2889 DNA fragment. Primer S81 (SEQ ID NO:21) 
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S8 1 CGTCGCCTTTAGC ATCCCCTTCCTTCAC 
helped define the 142bp upstream to the ATG initiation codon. On the 3' side. RACE PCR 
was initially performed with primers S823 (SEQ ID NO:22) and API (SEQ ID NO:4) . 

S823 GCTTGGCGCATCTAATAGAATCATGACAGG 
5 and then with the nested primers 637 (SEQ ID N'0:23) and AP2 (SEQ ID NO:5). 

637 GACAGCGTCAGTTCTTCAGAATGC 
to produce a 774bp DNA fragment. As for AtMSH3, those fragments were cloned and 
sequenced. Re-isolation of the DNA sequence using the high fidelity Pfu polymerase and 
newly designed primers 1S8 (SEQ ID NO:24) and S83 (SEQ ID NO:25) (for the 5* side) led 
10 to a 2182 bp DNA fragment identified as clone 43 (SEQ ID NO:26. cloned in 
pUCl8/Smal). and a 1379bp clone identified as clone 62 (SEQ ID NO:27, also cloned in 
pUC18/Smal). 

1 S8 ATCCCGGG ATGCAGCGCCAGAGATCG ATTTTGT 

2S8 ATCCCGGGTTATTTGGGAACACAGTAAGAGGATT (SEQ ID 

15 NO:28) 

582 GCGTTCGATCATCAGCCTCTGTGTTGC (SEQ ID NO:29) » 

583 CGCTATCTATGGCTGCTTCGAATTGAG ~ 
Figures 7 and 8 provide plasmid maps of clones 43 and 62 respectively, showing restriction 
enzyme cleavage sites. Clones 43 and 62 were digested by the Xmn\ restriction enzyme for 

20 which a unique site is present in their overlapping region and then Heated. The complete 
AtMSH6 coding sequence (SEQ ID NO:30) is 3330bp long and is shown in Figure 9 
together with the deduced sequence (SEQ ID NO.31) of the encoded polypeptide. AtMSH6 
is clearly homologous to the yeast and mouse .WStfogenes. A sequence alignment of 
polypeptides encoded by AtMSH6 and that encoded by Saccharomyces cerevisiae MSH6 is 

25 set out in Figure 10. 

An A(MSH6 genomic sequence was also isolated from a genomic DNA library 
constituted after partial Sau3Al digestion of DNA from the Arabidopsis cell suspension. 
8062bp were sequenced that covered the AtMSH6 gene and show colinearity with the 
cDNA. 16 introns are found scattered along the gene. The complete genomic sequence 

30 (SEQ ID NO:98) is shown in Figure 11. 

Example 2. A measure of somatic variation in MMR deficient plants 
Constructs 

Constructs with antisense AtMSH3 or antisense AtMSH6 or both AtMSH3/AtMSH6 
under the control of a single 35S promoter have been inserted into the binary vector 
35 pPZP121 (Hajdukiewicz et al.. Plant Mol. Biol. 23. 793-799) between the right and left 
borders of the T-DNA. The pPZP121 plasmid confers chloramphenicol resistance to 
Escherichia coli or Agrobacterium tumefaciens bacteria. The aacCl gene is carried by the 
T-DNA and allows selection of transformed plant cells on gentamycin (Hajdukiewicz et al., 
Plant Mol. Biol. 25. 989-994). For the purpose of expressing antisense constructs, a 35S 
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promoter/terminator cassette with a polyiinker was introduced into pPZPl 2 1 . The 3' ends of 
the considered genes have been chosen since this region seems more efficient for antisense 
inhibition. For AtMSH3 this corresponds to clone 13 (2104bp). for A/MSH6 this 
corresponds to clone 62 (1379bp). Clone 13 comprises 2104bp of the 3* region that were cut 

5 off the pUCl 8 vector by Sall/Sstl restriction, blunted with T4 DNA polymerase and ligated 
into the T4 DNA polymerase blunted BamHl site of pPZP121/35S, creating clone pPFl3. 
The same procedure was followed for the 3* region of AtMSHb clone 62 (1379bp) thus 
creating plasmid pPF14. For the double constructs, the 3 # region (from clone 62) of 
AtMSH6 was introduced ahead of the AtMSHl region into pPF13 creating pCW186 and 

10 reciprocally, the 3' region of AtMSH3 (from clone 13) was introduced ahead of AtMSH6 into 
pPF14, creating pCWl 87. 

These constructs were introduced into the Arabidopsis cells (as described below) of 
wildtype Columbia and of the Columbia tester line. 

An alternative strategy to antisense inhibition of AtMSH6 comes from experiments of 
is Marra et ah (1998. Proc. Natl. Acad. Sci USA 95. 8568-8573) who show that 
overexpression of functional MSH3 results in depletion of MSH6 protein in human cells. 
This depletion may generate a mismatch repair mutant phenotype. 

For the purpose of overexpressing functional At\dSH3 protein in plant cells, the 
complete MSH3 coding region was excised from pPF26 (example 1) by digestion with 

20 Smal, and was inserted into the Smal site of pCW164. The resulting construct was named 
pPF66. It contains a complete A(MSH3 gene under the control of the 35S promoter inside 
the left (LB) and right (RB) border of the T-DNA. This T-DNA also contains the hp(2 gene 
for gentamycin selection. Plasmid pPF66 was introduced into Arabidopsis cells as 
described below. One cell clone was selected which clearly overexpressed the AtMSH3 

25 gene as shown by Northern analysis. Figures 12-16 provide plasmid maps of plasmids 
pPF13, pPF!4, pCW186, pCW187 and pPF66. respectively. 

Construction of tester construct 

For the purpose of Forward Mutagenesis Assay s, a tester construct was. built 
containing the coding regions for nptll, codA. uidA. All three genes are driven by the 35S 

30 promoter and are terminated by the 35S terminator. This construct was obtained by 
introducing an EcoRl fragment encoding the codA cassette (2.5kb) and a Hindlll fragment 
encoding the uidA (GUS) cassette (2.4kb) into the pPZPlll vector (Hajdukiewicz et 
al.,1994, Plant Mol Biol 23: 793-799) which already contained the nptll expression cassette. 
This new plasmid was named pPF57. Nptll is used to select for transformed plant cells. 

35 GUS is used to analyse the degree of gene silencing in the construct (i.e. to identify cell 
lines in which the transgenes are expressed)* and cod A is used as a marker for forward 
mutagenesis (described below). 
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The plasmid map of pPF57 is provided in Figure 17. 
[Mam transformation 

The constructs are introduced into Agrohucterium by eleetroporation. Plant cells are 
then transformed by co-cuitivation. A suspension culture of Arabidopsis thaliana cells that 

s has been established by Axelos et al. (1992, Plant Physiol. Biochem. 30, 1-6) may be used. 
One day old freshly subcultured cells are diluted five times into AT medium (Gamborg B5 
medium. 30g/l sucrose, 200jig/l NAA). \0\x\ of saturated Agrobacterium containing the 
transforming T-DNA constructs are added to 10ml diluted cells in a 100ml erlenmeyer. The 
co-cultivation is agitated slowly (80rpm) for 2 days. The cells are then washed 3 times into 

10 AT medium and finally resuspended in the same initial volume (10ml). The culture is 
agitated for 3 days to allow expression before plating on selection plates (AT/BactoAgar 
8g/l+gentamycin 50ng/ml). Transformed individual caili are isolated 3 weeks later. 

Tester Strain 

The tester construct on plasmid pPF57 was introduced into Arabidopsis cells of 
is wildtvpe Columbia using the transformation protocol described above. Among 10 candidate 
transformants. one cell clone was shown (by Southern analysis) to have a unique t-DNA 
insertion. All three genes were shown to be functional in this cell line as indicated by 
resistance to kanamycin, blue staining in the presence of X-Glu (GUS). and sensitivity to 
5-fluoro-cytosine (codA). 
20 MMR altering genes (described above) were then introduced individually into the 

tester line and transformed cells are used for analysis of both Microsatellite Instability and 
Forward Mutagenesis. 
Microsatellite analysis 

Microsatellites have been described in Arabidopsis (Bell and Ecker, 1994, Genomics 
25 19. 137-144). The present Example is based on a study of instability of microsatellites that 
are polymorphic amongst different ecotypes. DNA is extracted from the transformed calli. 
Specific primers have been defined that are used to amplify the microsatellite sequence. 
One of the two primers is previously P 32 labelled by T4 kinase. In case of a polymorphic 
variation, new PCR products appear that do not follow the expected pattern of migration on 
30 a polyacrylamide gel. This is a commonly observed feature for MMR deficient cells in 
yeast or mammalian cells. 

In particular, the present Example describes a study on microsatellites ca72 (CA| 8 ), 
ngal72 (GA 29 ). and ATHGEN E A( A 39 ), chosen because they belong to the types 
predominantly affected in human mismatch repair deficient tumors. The size of these 
35 microsatellites is not conserved from one Arabidopsis ecotype to the other. 

Arabidopsis cells which are transformed with an MMR altering gene (above) and 
control cells not expressing the MMR altering gene are allowed to form calli. DNA is 
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rapidly extracted from the ealli and is analysed for microsatellite instability as described in 
detail bv Bell and Ecker 1994. Genomics 19. I3 7-U4. In summary, the relevant 
microsatellite is amplified by PCR using P32 labelled primers. The PCR products are 
separated on a DNA sequencing gel for s.ze determination. Size differences between 
5 microsatellites from transformed and control cells not expressing the MMR altering gene in 
question indicate microsatellite instability as a result of MMR alteration. 

The sequences of primers used for PCR amplification of microsatellites ca72 and 
ngai 72 are included in Table I . PCR amplification of microsatellite ATHGENEA made use 
of a forward primer containing the sequence 

1 o ACCATGCATAGCTTAAACTTCTTG (SEQ ID NO:32) 

and of a reverse primer containing the sequence 

ACATAACCACAAATAGGGGTGC (SEQ ID NO:33). 

The amplification for microsatellite ca72 revealed in Columbia control cells (with 
respect to the MMR altering gene) a 248 bp long PCR fragment instead of the published 
is length of 124 bp. DNA sequencing verified this fragment as a CA,„ microsatellite. 
Forward mutagenesis assay 

Tester cells transformed with antisense AtMSHS or antisense AtMSH6 or both 
AtMSH3/AtMSH6 are analysed for the stability of the codA gene. The functional codA gene 
confers to sensitivity to 5-fluoro-cytosine (5FC). whereas a gene inactivating mutation in 

20 codA will confer resistance to 5FC. The frequency of resistant cells is therefore a good 
indicator of somatic variation as a direct result of MMR alteration. Variants resistant to 5FC 
are first analysed for GUS activity. If GUS is inactive. 5FC resistance is assumed to be due 
to gene silencing (all three genes are under the 35S promoter). If GUS is active. 5FC 
resistance is assumed to be due to forward mutations that have inactivated codA. PCR is 

25 then performed on the putative codA mutant genes which is then sequenced to confirm the 
presence of forward mutations in codA. 

Besides codA. other marker genes may also be used for the Forward Mutagenesis 
Assay such as the ALS gene (conferring sensitivity to valine or to sulfonylurea; Hervieu and 
Vaucheret, 1996, Mol. Gen. Genet. 251 220-224: Mazur et al. 1987, Plant Physiol. 85 1 1 10- 
30 1117). 

Example 3. Homeologous meiotic recombination in Arabidopsis thaliana 

^ Construction of a meioevte speci fic gene expression cassette comprising the DMCl 

promoter and the NOS terminator 

(i) The DMCl promoter may be used as published by Klimyuk and Jones, 1997, 
35 Plant J. 11.1-14). To obtain a more convenient alternative for gene cloning, a 3.3 Kb 
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long subfragment of the DMC\ promoter was obtained by PCR from genomic DNA of 
Arabidopsis thaliana (ssp. Landsberg erecta "Z.er "). 
The PCR was done in three rounds: 

Round One: A 3.7 Kb long product was obtained using the forward primer 
5 DMC1N-A comprising the sequence 

GAAGCGATATTGTTCGTG (SEQ ID NO: 34) 
and the reverse primer DMCIN-B comprising the sequence 

AGATTGCGAGAACATTCC (SEQ ID NO:35). 
The weak amplification product was then used as template for round two and three. 
io Round Two: A 3.1 Kb long product comprising the promoter and the 5' 

untranslated leader was obtained using forward primer DMCIN-L which contained the 
sequence 

acgcgtcgacTCAGCTATGAGATTACTCGTG (SEQ ID NO:36) 
and introduced a Sail cloning site at the 5" end of the promoter fragment . and reverse 
15 primer DMCIN-2 which contained the sequence 

gctctagaTTTCTCGCTCTAAGACTCTCT (SEQ ID NO: 37) , 
and introduced a Xbal site at the 3" end of the PCR fragment. 

Round Three: A 0.2 Kb long product comprising the first exon/intron of the DMC\ 
promoter was obtained using forward primer DMCIN-3, which contained the sequence 
20 gctctagaGCTTCTCTTAAGTAAGTGATTGAT (SEQ ID NO:38) 

and introduced a Xbal site at the 5' end of the PCR fragment, and reverse primer 
DMCIN-4, containing the sequence 

tcccccgggctcgagagatctccatggTTTCTTCAGCTCTATGAATCC (SEQ ID NO:39) 
and introduced at the 3' end of the PCR product restriction sites for NcoL Bglll. Xhol and 
25 Smal. 

The products obtained in round Two and Three were digested with Xbal and 
subsequently ligated to reconstitute a 3.3 Kb long DMCX promoter from which the first 
two in-frame ATG start codons were replaced with a unique restriction site for Xbal. 
This promoter can be cloned between the restriction sites for Sail and Smal of p3264, 
30 which contains the Sacl-EcoKl NOS terminator in pBIN19, to yield the entire expression 
cassette in pBIN19. This cassette contains the following cloning sites: Ncol, Bgllh Xhol. 
Smal and (already present on p3264) Kpnl and Sad. 

(ii) Another strategy yielded the following convenient DMC1 promoter. A 1.8 kb 
DNA fragment comprising the 3* terminal pan of the meiocyte specific DMC1 promoter 
35 was isolated by PCR from purified genomic DNA of Arabidopsis thaliana (ssp. Landsberg 
erecta n Ler"). The forward PCR primer (DMCla) contained the sequence 
acgcgtcgacGAATTCGCAAGTGGGG (SEQ ID NO:40) 
and introduced a Sail cloning site at the 5 ' end of the promoter fragment. The reverse 
PCR primer (DMClb) contained the sequence 
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iccatggagatctcccgggtacCGATTTGCTTCGAGGG (SEQ ID NO:4l) 
introducing a poiylinker region at the 3' end of the promoter fragment. The PCR reaction 
was carried out with VENT DNA Polymerase (NEB) over 25 cycles using the following 
cycling protocol: 1 minute at 94°C, I minute at 54°C, 2 minutes at 72°C. 

5 The PCR reaction yielded a blunt ended DNA fragment which was digested with 
restriction enzyme Sail and was cloned into the cleavage sites of restriction enzymes Sail 
and Smal in plasmid p2030, a pUC118 derivative containing the Sacl-EcoRl NOS 
terminator fragment of pBIN121. The cloning yielded plasmid p2031, containing the 
DMCl promoter-poly linker-NOS terminator expression cassette depicted in Figure 18. 

to B. Construction of an MSH3 antisense gene under the control of the DMCl promoter 

A 2.1 kb DNA fragment encoding the carboxy terminal pan of AtMSH3 was 
removed from the poiylinker of clone 13 described in Example 1 after (i) digestion with 
Kpnl. (ii) blunting of the DNA ends generated by Kpnl and (iii) digestion with BarnHl. 
The isolated fragment was then cloned in antisense orientation downstream of the DMQ 
is promoter in plasmid p203L which had been digested with restriction enzymes Smal and 
Bglll. This cloning yielded plasmid p2033 (Figure 18). 

After digestion of p2033 with EcoRl, a 4.1 kb DNA fragment was recovered 
comprising the DMCl promoter, the partial MSH3 cDNA in antisense orientation with 
respect to the promoter and the NOS terminator. This fragment was cloned into the EcoRI 
20 restriction site of plant transformation vector pNOS-Hyg-SCV to yield plasmid p3242 
(Figure 18). 

C. Construction of a combined MSH6/MSH3 antisense gene under the control of a 
single DMCl promoter 

A 3.1 kb fragment, encoding in antisense orientation the partial AtMSH6 and AtMSH3 
25 sequences and the 35S terminator, was isolated from pCW186 by digestion with Kpnl. 
The fragment was treated with Klenow enzyme to blunt both ends. It was then cloned into 
the Smal site of plasmid p3243 (a pNOS-Hyg-SCV derivative, illustrated in Figure 19), 
which had been opened to delete the region between the Smal sites. Clones containing the 
fragment in the antisense orientation with respect to the DMCl promoter (described in 
30 A(ii) above) were identified by diagnostic digestion with BarnHl. The selected construct 
was named p3261. 

Another practical way of cloning the double antisense gene is as follows. A 1 kb 
DNA fragment encoding the carboxyterminal part of AtMSH6 is isolated from clone 62 
described in Example 1 after digestion of clone 62 plasmid DNA with BamHh which 
35 cleaves in the 5' poiylinker region flanking the partial cDNA, and with £coRI, which 
cleaves within the cDNA. The isolated fragment is treated with Klenow enzyme to blunt 
both its ends and is cloned into the recipient plasmid p2033 or p3242. For the purpose of 
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cloning, the recipient plasmid may be cleaved with either Aval or Nco\ and can be blunted 
with Klenow enzyme to produce blunt acceptor ends for fragment cloning. This cloning 
yields two opposite orientations of cloned fragment DNA with respect to the DMCl 
promoter. These can be identified by diagnostic digestion with restriction enzymes Seal 

5 or Xmnl in conjunction with Sacl. The selected construct contains the DMCl promoter, 
the combined partial cDNAs for AtMSH3 and AtMSH6 (both cloned in antisense 
orientation with respect to the DMCl promoter) and the NOS terminator. If the recipient 
plasmid is p2033, the combined antisense gene under control the single DMCl promoter is 
recovered from the vector after £coRI digestion and is cloned into the EcoRI restriction 

10 site of pNOS-Hyg-SCV. 

D. Construction of a full-length MSH3 sense gene under control of the DMCl promoter 
for overexpression of functional MSH3 protein 

Overexpression of MSH3 protein was shown in human cells (Marra et al., 1998, 
Proc. Natl. Acad. Sci. USA 95, 8568-8573) to complex all available MSH2 protein. This 
is leaves MSH6 protein without its partner, leading to the degradation of MSH6 protein aqjd 
eventually to a mismatch repair phenotype. 

This phenomenon is exploited to increase homeologous meiotic recombination in 
Arabidopsis as an alternative to antisense inhibition of AtMSH6. For this purpose the full- 
length cDNA encoding AtMSHZ is isolated from plasmid pPF66 by digestion with Smal 
20 and is cloned into the Smal site of the DAfCl expression cassettes described in A(i). 

E. Selection of Recombination markers on homeologous chromosomes o f Arabidopsis 
rhaliana subspecies Landsberv erecta (Ler). Columbia (Cati and C24. respectively 

E(i). Visual recombination markers in Arabidopsis th. subspecies C24: 

Agrobacterium mediated transformation with a T-DNA containing a 35S-GUS gene, 
25 inactivated by insertion of a 35S-Ac transposable element (Finnegan et al., 1993, Plant 
Moi. Biol. 22, 625-633), had yielded a C24 line in which the T-DNA construct was 
integrated into chromosome 2. Genetic and molecular analysis of this line shows that the 
Ac transposon had excised from its T-DNA locus thereby restoring CUS activity, but had 
re-inserted into the chromosome at a distance of 16.4 cM, where it stayed fixed (due to 
30 disablement of Ac) within the chlorina gene. Insertional inactivation of the chlorina gene 
caused a bleached phenotype in those plants that were homozygous for this mutation. 
Because of the two linked phenotypic markers, chlorina3A:Ac and GUS. this C24 line was 
used in crosses to wildtype Ler for analysis of meiotic homeologous recombination on 
chromosome 2 in conjunction with molecular recombination markers. 

35 Ef HV Visual recom bination markers in Arabidopsis th. Ler: 

The Ler line NW1 (obtained from NASC, Nottingham, UK) contains one recessive 
visual marker per chromosome, i.e. an-1 on Chr.l, py-1 on Chr.2, gll-1 on Chr.3, cer2-l 
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on Chr.4. and msl-1 on Chr.5. This line is used in crosses to wildtype C24 which 
expresses an MMR altering gene tor analysis of meiotic homeologous recombination on 
chromosomes 1-5 in conjunction with molecular recombination markers listed in Table 1. 

Other Ler lines from NASC have several visual markers in close proximity to each 
5 other on the same chromosome. When these lines are used for hybrid production, analysis 
of homeologous meiotic recombination may make use entirely of visual recombination 
markers. Particularly suitable for crossing to C24 wildtype that is expressing a MMR 
altering gene are the following Ler lines: 

NW22: relative markers are disl - (4 cM) - ga4 - (1 1 cM) - thl on chromosome 1. 
,o NW10: relevant markers are tz-201 - (5 cM) - cer3 on chromosome 5. 
NW134, relevant markers are ttg - (4 cM) - ga3 on chromosome 5. 
NW24 (abi3-l) and NW64 (gll-\). When present in the same plant on chromosome 
3. abi3-\ and gll-l are 13 cM apart. Since this marker combination is not available from 
NASC. we have combined these markers by crossing line NW24 to line NW64. The.Fl 
, 5 offspring were allowed to self-fertilise and to produce F2 seeds. F2 Plants which carry 
both markers as homozygous traits on the same chromosome can be identified firstly, by 
germinating F2 seeds on germination medium containing selective concentrations of 
abscisic acid, and subsequently, by identifying among the abscisic acid resistant plants 
those individuals which show the glabra phenotype. 
20 E(iirt Molecular recombination marke rs in Col. Ler and C24: 

The genome of Arabidopsis thaliana is interspersed with unique base sequences 
arranged as simple tandem repeats. Allelic repeats can vary in length between different 
Arabidopsis subspecies and when amplified by PCR yield diagnostic DNA products of 
different length named Simple Sequence Length Polymorphisms (SSLPs). Many SSLPs 
25 have been genetically mapped and have been assigned to unique chromosome locations on 
the recombinant inbred map (Bell and Ecker. 1994, Genomics 19, 137-144. Lister and 
Deans lines. Weeds World 4i, May 1997). 

In Table 1 are listed 28 mapped and established SSLPs between Ler and Col. A 
number of PCR primer pairs are described herein (SEQ ID NO:42 to SEQ ID NO:97) 
30 which also yielded SSLPs between C24 and Ler (19 SSLPs) or between C24 and Col (25 
SSLPs), respectively. Polymorphic SSLPs can be used as molecular markers in the 
analysis of homeologous recombination between genomes from these subspecies. 

The PCR reactions (25 uL) were carried out over 35 cycles (15 seconds at 94°C, 30 
seconds at 55'C and 30 seconds at 72»C), with 0.25 U Taq DNA polymerase and 0.6 M g 
35 genomic DNA in reaction buffer containing 2 mM MgCl 2 . PCR products were separated 
r by agarose gel electrophoresis (4% ultra high resolution agarose) and visualised by 
' ethidiumbromide staining. The results from the PCR experiments are summarised in 
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Table I, which also shows the sequence of PCR primers, primer annealing temperature 

(Tm). PCR product length and chromosome location of SSLP (with respect to the RI map 

of May 1997, Weeds World 4i). 

F Production of hybrid plants 
5 C24 plants heterozygous for chlorina3A:AclGUS are crossed as male to emasculated 

wildtype Ler to produce Ler/C24(chlorina3A, GUS) hybrid seeds. 

Due to the heterozygosity of the C24 parent, only 50 % of hybrid plants have 

inherited the chlorina3A:Ac/GUS locus. The remaining 50% of hybrid plants are wildtype 

with respect to chlorina3A:Ac/GUS. Since the mutant locus is linked to a kanamycin 
10 resistance gene (contained on the same T-DNA as GUS) mutant plants can be pre-selected 

by germinating hybrid seeds on germination medium containing 50 mg/L kanamycin. 

Ler plants homozygous for the five chromosome markers are male sterile (msl-1) 

and are crossed without emasculation to wildtype C24 to produce Lerian-l. py-l. gll-l. 

cer2-l. msl-l)lC24 hybrid seeds. 
,5 Other Ler plants, which are male fertile, are crossed after emasculation of the female 

parent to produce LerlC24 hybrid seeds. 

r. Introduction of MSH3 and MSH6/3 antisense genes into Arabidonsis and analysis of 
meiotic homeologous recom bination 

(i) Transformation of hybrid plants and analysis of h omeologous meiotic recombination 

20 The plant transformation vectors comprising the antisense genes described in (B) and 
(C) above are introduced into Agrobacterium tumefaciens strain AGLl (Lazo et al.. 1991, 
Bio/Technology 9, 963-967) by electroporation. Recombinant Agrobacterium clones are 
selected on LB medium containing 50 mg/L rifampicin and 100 mg/L carbenicillin. 
Selected clones are used to infect roots of Arabidopsis hybrid plants (described in (F) 

25 above) using the root transformation protocol of Valvekens et al. (1988, PNAS 85. 5536- 
5540) except that shoot and root inducing media contain hygromycin (10 mg/L) instead of 
kanamycin. 

Plants regenerated from roots of hybrid plants are genetic clones of root donating 
plants and therefore are again genetic hybrids of two Arabidopsis subspecies described in 
30 (F). However, in contrast to the root donating plants, the regenerated hybrid plants also 
contain the introduced transgene and the co-introduced hygromycin resistance gene with 
the latter allowing these plants to grow on hygromycin containing culture medium. 

Hygromycin resistant plants are then allowed to enter the reproductive phase and to 
produce gametes by meiotic divisions of microspore and megaspore mothercells. At 
3S meiosis, the DMC\ promoter is activated and can direct the expression of antisense genes 
described in (B) and (C) above, leading to decreased MSH6 and/or MSHS gene 
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expression. This in turn depletes the gamete mothercells of MSH6 and/or MSH3 protein, 
thus causing alteration of MMR during meiotic divisions and increasing the recombination 
frequency between homeologous chromosomes. 

Transgenic plants are then allowed to self- fertilise and to produce seeds. These 
5 seeds (F2 seeds with respect to hybrid production), and the plants derived therefrom, 
carry the homeologous recombination events which can be identified by using the visual 
and molecular recombination markers described in (E) above. 

In case of homeologous recombination between chromosomes of Ler and 
C24(chlorina3A:Ac, GUS). the analysis concentrates on chromosome 2 by selecting plants 

io showing the visual phenotypic marker chlorina. This marker thus serves as a reference 
point as it indicates that respective chromosomes 2 originate from C24. Other markers, 
such as GUS or molecular markers, on chromosome 2 may then be used to identify 
chromosomal regions which are derived from the Ler chromosome as a result of 
homeologous recombination. F2 plants of control transfonnants not expressing the 

1 5 antisense gene(s) can be analysed in parallel and the results can be used for comparison *o 
homeologous recombination results obtained in antisense plants. 

(ii) Transformation of C24 wildtvpe. hvbrid plant production and analysis of homeologous 
meiotic recombination 

Introduction of MMR altering genes into wildtype C24 is done using the root 
20 transformation protocol as described in G(i) for transformation of hybrid plants. 
Transformed plants are selected by resistance to either 10 mg/L hygromycin (in case of 
transformation with T-DNA's derived from pNOS-Hyg-SCV) or to 50 mg/L kanamycin 
(in case of transformation with T-DNA's derived from pBIN19). 

Transgenic plants are then allowed to self-fertilise and to produce seeds (Tl seeds). 

25 Segregation of the antibiotic resistance gene in the Tl population then indicates the 
number of transgene toci. Lines with a single transgene locus (indicated by a 3:1 ratio of 
resistantrsensitive plants) are selected and are bred to homozygosity. This is done by 
collecting selfed seeds (T2) from Tl plants and subsequent testing of at least four 
independent T2 seed populations for segregation of the antibiotic resistance gene. T2 

30 populations which do not segregate the antibiotic resistance gene are assumed to be 
homozygous for both the resistance gene and the linked MMR altering gene. 

C24 plants homozygous for the MMR altering gene are then crossed to Ler lines 
homozygous for recessive visual markers (see E(ii)) to produce C24/Ler hybrid plants as 
described in (F). These Fl hybrids are then allowed to enter the reproductive phase and to 
35 produce gametes by meiotic division of microspore and megaspore mothercells. At 
meiosis. the DMC 1 promoter is activated and can direct the expression of antisense or 
sense genes described in (B) t (C) and (D) above, leading to decreased MSH6 and/or MSH3 
gene expression. This in turn depletes the gamete mothercells of MSH6 and/or MSH3 
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protein, thus causing alteration of MMR during meiotic divisions and increasing the 
recombination frequency between the homeologous chromosomes of C24 and Ler. 
Recombination events are then scored in the F2 generation. 

For recombination analysis, the hybrid plants are allowed to self- fertilise and to 
5 produce F2 seeds. F2 plants are then preselected for a first visual marker. Since this 
marker is recessive, its visual presence indicates homozygosity for Ler DNA at the 
relevant locus. Those F2 plants which show this first visual marker are then analysed for 
the presence or absence of a second visual marker which in the Ler parent is closely linked 
to the first marker. Absence of the second visual marker indicates recombination between 
io the relevant C24 and Ler chromosomes between the first and second marker. The 
frequency of recombination in transgenic hybrids is compared to the recombination 
frequency in control hybrids not expressing the MMR altering gene. 

Examples of recombination analysis are the following. 

The Ler line NW22(d/s/. ga4 . thl) is used for crosses to transformedC24. 

is F2 plants are preselected first for thiamine requirement (thl) and then are further 

analysed for re-appearance of wildtype height (loss of ga4) and/or re-appearance of 
normal trichomes (loss of disl) as a result of recombination. ^: 

The Ler line NW10(c-20/, cer3 ) is used for crosses to transformedC24. 

F2 plants are then preselected first for thiazole requirement (tz) and then are further 
20 analysed for re-appearance of normal, i.e. non-shiny stems (loss of cer3) as a result of 
recombination. 

The Ler line NW134 (ttg , ga3 ) is used for crosses to transformedC24. F2 plants 
are first preselected for dwarfish appearance (ga3) and are then analysed for re-appearance 
of trichomes (loss of ttg) as a result of recombination. 

25 Ler plants homozygous for abi3-l and gli-l are used for crosses to transformed C24. 
F2 plants are first preselected for their ability to germinate in the presence of abscisic acid 
and are then analysed for re-appearance of trichomes on the leaves (loss of g/J-l) as a 
result of recombination. 

In the case of homeologous recombination between transformed C24 and the Ler line 
30 NW1 (a/i-7, py-l 9 gll-1. cer2-l, mj/-l), recombination analysis is similar the one 
described above, except that the second marker is not a visual marker but has to be a 
molecular marker. This is because the Ler parent carries only one visual marker per 
chromosome. 
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CLAIMS 

I. An isolated and purified DNA molecule comprising a polynucleotide sequence 
encoding a polypeptide functionally involved in the DNA mismatch repair system of a 
plant. 

5 2. A DNA molecule according to claim 1 wherein said polypeptide is 
homologous to a mismatch repair polypeptide of a yeast or of a human. 

3. A DNA molecule according to claim 1 wherein said polypeptide is 
homologous to AtMSH3 (SEQ ID NO: 19) or to AtMSH6 (SEQ ID NO: 31). 

4. An isolated and purified polypeptide functionally involved in the DNA 
io mismatch repair system of a plant. 

5. A polypeptide according to claim 4 which is homologous to a mismatch repair 
polypeptide of a yeast or of a human. 

6. An isolated and purified polypeptide selected from the group consisting of a 
polypeptide encoded by the gene AtMSH3 (SEQ ID NO: 18). a polypeptide encoded by me 

is gene AtMSH6 (SEQ ID NO:30), polypeptides homologous to a polypeptide encoded by the 
gene AtMSH3 (SEQ ID NO: 18) and polypeptides homologous to a polypeptide encoded 
by the gene AtMSH6 (SEQ ID NO:30). 

7. An isolated and purified DNA molecule comprising a polynucleotide sequence 
selected from the group consisting of (i) a sequence encoding a polynucleotide which is 

20 capable of interfering with the expression of a plant polynucleotide sequence encoding a 
polypeptide which is homologous to a mismatch repair polypeptide of a yeast or of a 
human and thereby disabling said plant polynucleotide sequence; and (ii) a sequence 
encoding a polypeptide capable of disrupting the DNA mismatch repair system of a plant. 

8. A DNA molecule according to claim 7 comprising a polynucleotide sequence 
25 encoding a polynucleotide capable of interfering with the expression of a plant 

polynucleotide sequence encoding a polypeptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human and thereby disabling said plant polynucleotide 
sequence. 

9. A DNA molecule according to claim 8 wherein said polynucleotide is capable 
so of interfering with the expression of a plant polynucleotide sequence is a sense 

polynucleotide, an antisense polynucleotide or a ribozyme. 

10. A DNA molecule according to claim 7 comprising a polynucleotide sequence 
encoding a polypeptide capable of disrupting the DNA mismatch repair system of a plant. 
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11. A DNA molecule according to claim 10 wherein said polypeptide is 
homologous to AtMSH3 (SEQ ID NO: 19) or to AtMSH6 (SEQ ID NO: 31). 

12. A DNA molecule according to claim 10 further comprising a regulation 
element capable of causing overexpression of said polypeptide in a cell of said plant. 

s 13. A chimeric gene comprising: 

a DNA sequence selected from the group consisting of (i) a sequence encoding a 
polynucleotide capable of interfering with the expression of a plant polynucleotide 
sequence encoding a polypeptide which is homologous to a mismatch repair polypeptide of 
a yeast or of a human and thereby disabling said plant polynucleotide sequence, and (ii) a 
io sequence encoding a polypeptide capable of disrupting the DNA mismatch repair system 
of a plant; and 

at least one regulation element capable of functioning in a plant cell. 

14. A chimeric gene according to claim 13 wherein said regulation element is 
selected from constitutive, inducible, tissue type specific and cell type specific promoters. 

is 15. A chimeric gene according to claim 13 comprising a DNA sequence encoding 

a polypeptide capable of disrupting the DNA mismatch repair system of a plant/wherein 
said regulation element is capable of causing overexpression of said polypeptide in a cell 
of said plant, 

16. A chimeric gene according to claim 13 wherein said regulation element is 
20 selected from the group consisting of 35S, NOS, PRla, AoPRl and DMC1. 

17. A plasmid or vector comprising a chimeric gene according to any one of 
claims 13-16. 

18. A plant cell stably transformed, transfected or electroporated with a plasmid or 
vector according to claim 17. 

25 19. A plant comprising a cell according to claim 18. 

20. A plant according to claim 19 selected from plants of the families 
Brassicaceae, Poaceae, Solanaceae, Asteraceae, Malvaceae, Fabaceae. Linaceae, 
Canabinaceae, Dauaceae and Cucurbitaceae. 

21. A process for at least partially inactivating a DNA mismatch repair system of a 
30 plant cell, comprising transforming or transfecting said plant cell with a DNA molecule 

according to any one of claims 1-3 or 7-12 and causing said DNA sequence to express 
said polynucleotide or said polypeptide. 

22. A process for at least partially inactivating a DNA mismatch repair system of a 
plant cell, comprising transforming or transfecting said plant cell with a chimeric gene 
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according to any one of claims 13-16 and causing said DNA sequence to express said 
polynucleotide or said polypeptide. 

23 . A process for at least partially inactivating a DNA mismatch repair system of a 
plant cell, comprising transforming or transfecting said plant cell with a plasmid or vector 

5 according to claim 17 and causing said DNA sequence to express said polynucleotide or 
said polypeptide. 

24. A process for increasing genetic variation in a plant comprising obtaining a 
hybrid plant from a first plant and a second plant, or cells thereof, said first and second 
plants being genetically different; altering the mismatch repair system in said hybrid plant: 

,o permitting said hybrid plant to self-fertilise and produce offspring plants; and screening 
said offspring plants for plants in which homeologous recombination has occurred. 

25 A process according to claim 24 wherein a first gene is incapacitated in said 
first plant, a second gene is incapacitated in said second plant, and said first and second 
genes are incapacitated in said hybrid plant thereby altering the mismatch repair system of 

i s said hybrid plant. 

25. A process according to claim 25 wherein said incapacitation of the mismatch 
repair system of said hybrid plant is reversible. 

26 A process according to claim 24 wherein a new genetic linkage of a desired 
characteristic trait or of a gene which contributes to a desired characteristic trait is 

20 observable in at least one of said offspring plants. 

27 A process for obtaining a plant having a desired characteristic, comprising 
altering the mismatch repair system in a plant, cell or plurality of cells of a plant which 
does not have said desired characteristic, permitting mutations to persist in said cells to 
produce mutated plant cells, deriving plants from said mutated plant cells, and screening 

25 said plants for a plant having said desired characteristic. 

28 A process accordine to claim 27 wherein said step of altering the mismatch 
repair system comprises introducing into said hybrid plant, plant, cell or cells a chimeric 
„ene according to claim 13 and permitting the chimeric gene to express a polynucleotide 
which is capable of interfering with the expression of a plant polynucleotide sequence in a 

ao mismatch repair gene of the hybrid plant, plant, cell or cells, or a polypeptide capable of 
disrupting the DNA mismatch repair system of the hybrid plant, cell or cells. 

29. A process according to claim 28 comprising inactivating an MSH3 gene and/or 
an MSH6 gene of said plant. 

30. A process according to claim 28 comprising inactivating an MSH3 gene and an 
35 MSH6 gene of said plant. 
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31. A process according to claim 27 comprising at least partially inactivating the 
mismatch repair system of said plant in a predetermined cell type or in a predetermined 
tissue of said plant. 

32. A process according to claim 31 further comprising restoring mismatch repair 
5 in said cell type or said tissue. 

33. An oligonucleotide capable of hybridising at 45°C under standard PCR 
conditions to a DNA molecule according to claim 1 with the proviso that said 
oligonucleotide is other than SEQ ID NO:l or SEQ ID NO:2. 

34. An oligonucleotide capable of hybridising at 45°C under standard PCR 
10 conditions to the DNA of SEQ ID NO: 18 with the proviso that said oligonucleotide is 

other than SEQ ID NO.l or SEQ ID NO:2. 

35. An oligonucleotide capable of hybridising at 45°C under standard PCR 
conditions to the DNA of SEQ ID NO: 30 with the proviso that said oligonucleotide is 
other than SEQ ID NO:l or SEQ ID NO:2. 
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TTTTTTGGTTGCT AACAAT AAAGGTAT ACGGTTTTATGTCATCAATATAA 5 0 

CTATATATAAAAGAAATGAAAGATATATATTGTTTTTTCATTTATCAAAC 100 

AAAAC AACAAGACTTTTTTTTT ACTTTTT ACATTGGTCAAC AAAATACAA 150 

GATAAACGACATCGTTTAATCATTTCCCAATTTTACCCCTAAGTTTAACA 200 

CCTAGAACCTTCTCCATCTTCGCAAGCACAGCCTGATTAGGAACAGCTTT 250 

ACCAT7CTCATATTCCTGAACTACCTGAGTCCTCTCATTGATCTGT7TCG 300 

CCAAATCCGCTTGTGACATCTTCTTCTCCAATCTCGCTTTCTGTATCATC 350 

AACCTCACCTCTGCTTTCACACGATCCATCGCCGCAGGCTCTGTTTCTTC 400 

TTCCAGCTTCTTCGTGTTAATCACCGGAACCGCCGTAGATTTCCCCTTTT 4 50 

TGTTCGAACCGGCATCGAATTTCTTAACCGTTTGAACCGCGACACCGTTT 500 

CTCAGAGCTGCGTTAACCGCTTTCGGATCGCGTAGGTCTTGGCTCTTTTG 550 

TTTTGATTTGTGGAGAACTACTGGTTCCCAGTCTTGTGTTACTGCTCCTG 600 

GGTATCTGCTCGGCATCGTCGATGAATTGAGAGAAAGGAACAACGCGAAA 650 

ATTTTATT AATCTGAGTTT7GAAA7TGAGAAACGATGAAGATGAAGAATG 700 

TTGTTGAGAGGATTGTGATATTTATAT AT ACGAAGATTGGTTTCTGGAGA 750 

ATTCGATCA7C7T7TTCTCCA7TTTCG7C7C7GGAACG7TCTTAGAGATG 800 

ATTGACGACGTGTCA7T ATCTGATT7GCAGT7 AACCAATGCTTTTTGGGT 850 * 

7GGATTCG7GGTACACCATATTATCCGA777GGC7CAATGGT77TA7A7A 900 

AATTT GG7T7TCGG77CGG77 ATGAGTT ATC ATTAAAATTAAGCT AACC A 950 . 

AAAATT7TCGTAAAATTT ATTTCGG7TTCAATTCGGATCCCTTACTTCCA 1 000 

GAACCGAATTA77CGAAACCGGGG77A3CCGAACCGAATACCAATGCC7G 1050 

ATTGAC7CG77GGCTAGAAAGA7CCAACGG7ATACAATAATAGAACATAA 1100 

A7CGGACGG7CA7CAAAGCC7CAAAGAG7GAACAGTCAACAAAAAAAG77 1 150 

G AGCCC7G AGGAG7A7CGTT7CCGCCA7T7C7ACGACGCAAGGCGAAAAT 1200 

T7T7GGCGCCAATC7T7CCCCCCT77CGAA7TCTCTCAGCTCAAAACATC 1250 

G777C7C7C7CACTC7C7C7CACAA77CCAAAAAATGCAGCGCCAGAGAT 1300 

CGAT7T7G7C7TTC77CCAAAAACCCACGGCGGCGAC7ACGAAGGG7TTG 1350 

GTTTCCGGCGA7GCTGCTAGCGGCGGGGGCGGCAGCGGAGACCACGATTT 14 00 

AA7G7GAAGGAAGGGGATGCTAAAGGCGACGCTTC7G7ACG77T7GCTGT 1450 

7TCGAAATCTG7CGATGAGGTT AGAGGAACGGATACTCCACCGGAGAAGG 1500 

TTCCGCGTCGTGTCCTGCCGTCTGGATTTAAGCCGGCTGAATCCGCCGGT 1550 

GATGCTTCG7CCCTGTTCTCCAATA7TATGCATAAGTTTGTAAAAGTCGA 1600 

TGATCGAGATTGTTCTGGAGAGAGG7ACTAATCTTCGATTCTCTT AATTT 1*50 

TGTTATCTTTAGCTGGAAGAAGAAGATTCGTGTAATTTGTTGTATTCGTT 1 7 00 

GGAGAGATTCTGATTACTGCATTGGATCGTTGTTTACAAATTTTCAGGAG 1 7 5 0 

CCGAGAAGATGTTGTTCCGCTGAATGATTCATCTCTATGTATGAAGGCTA I 800 

ATGATG7TATTCCTCAATTTCGTTCCAATAATGGTAAAACTCAAGAAAGA 1 8 50 

AACCATGCTTTTAGTTTCAGTGGGAGAGCTGAACTTAGATCAGTAGAAGA 1^00 

TATAGGAGTAGATGGCGATGTTCCTGGTCCAGAAACACCAGGGATGCGTC 1^50 

CACGTGCTTCTCGCTTGAAGCGAGTTCTGGAGGATGAAATGACTTTTAAG 2000 

GAGGATAAGGTTCCTGTATTGGACTCTAACAAAAGGCTGAAAATGCTCCA 2050 

GGATCCGGTTTGTGGAGAGAAGAAAGAAGTAAACGAAGGAACCAAATTTG 2100 

AATGGCTTGAGTCTTCTCGAATCAGGGATGCCAATAGAAGACGTCCTGAT 2150 

GATCCCCTTT ACGATAGAAAGACCTT ACACATACCACCTGATGTTTTCAA 2200 

Figuro 11 
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GAAAA7G7C7GCATCACAAAAGCAA7A77GGAG7G77AAGAGTGAA7ATA 
TGGACATTGTGCTTTTCTTTAAAGTGGTTAGTAACTATTAATCTAGTGTT 
CAATCCATTTCCTCAATGTGATTTGTTCACTTACATCTGTTTACGTTATG 
CTCTTCTCAGGGGAAATTTTATGAGCTGTATGAGCTAGATGCGGAATTAG 
GTCACAAGGAGCTTGACTGGAAGATGACCATGAGTGGTGTGGGAAAATGC 
AGACAGGTAAATTAGTTGAAACAACTGGCCTGCTTGAATTATTGTGTCTA 
TAAATTTTGACACCACCTTTTGTTTCAGGTTGGTATCTCTGAAAGTGGGA 
TAGATGAGGCAGTGCAAAAGCTATTAGCTCGTGGGTAAGGGAACCATCAT 
ACTTTATGGAATTCGTTTACTGCTACTTCGGCTAGGATTTAAGAAATGGA 
AATCACTTCAAGCATCATTAGTTAGGATCCTGAGAACTCAGGATGTTTTC 
TTATTCGTTATATAATAAGTCTTTTCATCAAGGAGTAACAAACAAAACTT 
GCACAATATTTGTGTGCTCACTGGCAAGGCATATATACCCAGCTAACCTT 
TGCTAGTTCACTGTAGTAACAGTTACGGATAATATATGTTTACTTGTATG 
TGGTACCCTCATTTTGTCTCTCATGGAGGCTTTGAAGCCTTGTGTTGAAA 
CTGGATAGTTACATATGCTTCCAACAGAAACTAGCATGCAGATTCATATG 
CTTTCCTAT7C7ACTAA7TATG7A77GACACACTCG7TC7TTCT7TTGAA 
AGATA7AAAG77GGACGAA7CGAGCAGC7AGAAACA7CTGACCAAGCAAA 
AGCCAGAGG7GC7AA7ACTGTAAG777777T3GA7AGGTCAAGGAGAGTG 
TTGCAGAC7G77777GA7CA777C7777777G7ACA77AC777CA7GC7G 
TAATTAACTCAA7GGC7A77CTGG7C7GA7TATCAGATAA7TCCAAGGAA 
GCTAG7TCAGG7A7TAACTCCA7CAACAGCAAGCGAGGGAAACA7CGGGC 
CTGATGCCG7CCATCT7CTTGC7A7AAAAGAGG7T7GT7A7TTAC7TATT 
TATC77A7CA7G77CAG77CATCCAAG7CC7GAAAAA7TACAC7C77CTT 
TACCAA7C77CCA7CAAGC7G7G7AAAGGA777GGAA77AGAAAA7CATT 
ATTTGATGC777G777TATATGCAAGAGGT7CCC7TGAAAAGA7C7GTTT 
AAGA77C777GCAC77GAAAAA7TCAA7C77777AAGTGAA7CCGC7AC7 
TTC77ACAA7GA7CATAG7C7GCAA77GCA7G7CAAG7AA7A7CA77CC7 
TGTTACTGCATCCCCCTCTTTCTTAATGAQCATTG7CTA7GT7G7GTTTC 
TCTCGTGTGCTGGAGAAAATGATAGCTGATCCAAGC7G7ACA77A7CATG 
ATTAAGTAGCTGCTCAGGAATTGCC777GGTTACAT7GCC7AA7GGT7TG 
ATGTCAATTTT7CT7CTGAATCTT7A77TTAGA7CAAAA7GGAGC7ACAA 
AAGTGTTCAACTGTGTATGGATTTGCTTTTGT7GACTGTGC7GCC77GAG 
GTTTTGGGTTGGGTCCATCAGCGATGATGCATCATGTGC7GC7C7TGGAG 
CGTTATTGATGCAGGTAAGCAAG7GTATTCTGTA7CT7ATGTG7ACCATG 
TGACTTCCTG7GCATATATTTGGGT7GCAGGAACTAATTCTGAA7CACCA 
TTTGGTATGTTTTTTCCAGGTTTCTCCAAAGGAAGTGT7ATA7GACAGTA 
AAGGTAAACTGCTTGTA7CGCCAGTTGTTTTGTTAAACAGAA7T7AAGGT 
AAATGACACTGGTTAATTTAAAGTGCATACATGTTGAAATAT7GCAGGGC 
7ATCAAGAGAAGCACAAAAGGCTCTAAGGAAATATACGTTGACAGGTACC 
ATTTCAGTAGGCAAGCTAACTGACAATTTAACCGCTCACCGAATGATAGG 
TCTCTTAAACATTGCTAATGTAGATGATGTTTATGTTTCAATCTAATAGG 
GTCTACGGCGGTACAGTTGGCTCCAGTACCACAAGTAATGGGGGATACAG 
ATGCTGCTGGAGTTAGAAATATAATAGAATCTAACGGATACTTTAAAGGT 
TCTTCTGAATCATGGAACTGTGC7GT7GATGGTCTAAATGAATGTGATGT 

Figure XI (Continued) 
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TGCCC77 AGTGCTC77GGAGAGC7 AA77 AA7CA7C7G7C7 AGGC7AAAGG 4 4 50 

TGTGTTGGCTTGTTTAGTTTTTGCTTTTCACAAATTAAGCAAAGGAACTT 4 500 

TTCATAACTTACAGTTTCTATCTACTTGCAGCTAGAAGATGTACTTAAGC 4 550 

ATGGGGATATTTTTCCATACCAAGTTTACAGGGGTTGTCTCAGAAT7GAT 4 600 

GGCCAGACGATGGTAAATCTTGAGATATTTAACAATAGC7GTGATGGTGG 4 650 

TCCTTCAGGCAAGTGCATATTTCTTTTTTGAT AACTTCAACTAGAGGGCA 4700 

GACATAGAAGGAAAAATTCTAATACTTCGTACGGATCTCCAGTAAGTAAT 4 750 

AGCCGATTTTTGTTTACCTATGTAGGGACCTTGTACAAATATCTTGATAA 4800 
CTGTGTTAGTCCAACTGGTAAGCGACTCTTAAGGAATTGGATCTGCCATC . . 4850 

CACTCAAAGATGTAGAAAGCATCAATAAACGGCTTGATGTAGTTGAAGAA 4 900 

TTCACGGCAAACTCAGAAAGTATGCAAATCACTGGCCAGTATCTCCACAA 4 950 

ACTTCCAGACTTAGAAAGACTGCTCGGACGCATCAAGTCTAGCGTTCGAT 5000 

CATCAGCCTCTGTGTTGCCTGCTCTTCTGGGGAAAAAAGTGCTGAAAC AA 5050 

CGAGTAAGTATCAATCACAAGTTTTCTGAGTAATGCCTTCCATGAGTAGT 5100 

ATAGGAC7 AAAAC A77ACGGG7C7AGC7 AAAGAC7G77C7CC77C7777G 5150 

CAATG7CTGGTTATTCATTACATTTCTCTTAACTTATTGCATTGCAGGTT 5200 

AAAGCA7TTGGGCAAATTGTGAAAGGGTTCAGAAGTGGAATTGATCTGTT 5250 * 

G77GGC7C7ACAGAAGGAA7CAAATA7GATGAG77TGCT77ATAAACTCT 5300 

G7 AAAC7 7 CC7 AT A77 AG7 AGG AAAAAGC GGGCT AG AG 7 7 A777 C7 77 CT 5350 

CAA77CGAAGCAGCCATAGA7AGCGAC777CCAAAT7A7CAGG7GCCCAT 54 00 

CTATCT77CA7AC777ACAACAAAA7G7C7G7CACTACTCAAAGCAATGC 54 50 

A7A7GGC77AGA7C7CAAC7CACACCCCGAGGA7CC7AAAGGGA777GC7 5500 

7777 A77CC7AA7G77777GGA7GG777GA777A777C7 AAC77GAAC7T 5550 

AT7AA7C77G7ACCAGAACCAAGA7G7GACAGA7GAAAACGC7GAAAC7C 5600 

TCACAA7ACT7ATCGAAC77TTTATCGAAAGAGCAAC7CAATGG7C7GAG 5 650 

GTCA77CACACCA7AAGC7GCC7AGA7G7CC7GAGA7C7777GCAA7CGC 57 00 

AGCAAG7C7C7C7GC7GGAAGCA7GGCCAGGCC7G77A77777CCCGAA7 57 50 

CAGAAGC7 ACAGA7CAGAA7CAGAAAACAAAAGGGCCAATAC77AAAA7C 5800 

CAAGGAC7A7GGCA7CCA777GCAG77GCAGCCGA7GG7CAA77GCC7G7 5850 

7CCGAATGATA7AC7CCTTGGCGAGGC7AGAAGAAGCAG7GGCAGCATTC 5900 

ATCC7CGGTCATTG77ACTGACGGGACCAAACATGGGCGGAAAA7CAAC7 5950 

C77C77CGTGCAACATGTCTGGCCGT7A7C777GCCCAAGT77G7A7ACT 6000 

CG77AGA7AA77AC7C7A77C777GCAA7CAG77C77CAACA7GAA7AA7 6050 

AAAT7CTGTTTTCTGTCTGCAGCTTGGC7GCTACGTGCCG7G7GAG7CTT 6100 
GCGAAATCTCCCTCGTGGATACTATCT7CACAAGGC77GGCGCA7CTGAT 6150 
AGAA7CA7GACAGGAGAGAGTAAGTT7TG7TCTCAAAA7ACCAAT7CCTC ^200 
GAAC7A77TACTCAGAT7T7GTC7GAT7GGACAAGG7GG7777GC7TTTT " 50 
7T7AGG7ACCTTTT7GGTAGAATGCAC7GAGACAGCG7CAG7TC7TCAGA 6300 
A7GCAAC7CAGGATTCAC7 AG7AA7CCTTGACGAAC7GGGCAGAGGAACT 6350 
AG7ACTTTCGATGGA7ACGCCAT7GCA7ACTCGG7AACC7GCTCTTCTCC ^400 
77CAACT7 AT ACTTG77GATCAACAAAAACA7GCAA77CA7777GC7GAA 6450 
AC7TATTGATTTATATCAGGTTTTTCGTCACCTGGTAGAGAAAGTTCAAT 6500 
GTCGGATGCTCTTTGCAACACATTACCACCCTCTCACCAAGGAATTCGCG 6550 
TC7CACCCACGTGTCACCTCGAAACACATGGCTTGCGCAT7CAAA7CAAG 6 600 

Figure 11 (Continued) 
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ATCTGATTATCAACCACGTGGTTGTGATCAAGACCTAGTGTTCTTGTACC 6650 

GTTT AACCGAGGG AGCTTGTCCTGAGAGCT ACGG ACTTCAAGTGGCACTC 6700 

ATGGCTGG AATACCAAACCAAGTGGTTGAAAC AGCATCAGGTGCTGCTCA 67 50 

AGCCATGAAGAGATCAATTGGGGAAAACTTCAAGTCAAGTGAGCTAAGAT 6300 

CTGAGTTCTCAAGTCTGCATGAAGACTGGCTCAAGTCATTGGTGGGTATT 6850 

TCTCGAGTCGCCCACAACAATGCCCCCATTGGCGAAGATGACTACGACAC 6900 

TTTGTTTTGCTTATGGCATGAGATCAAATCCTCTTACTGTGTTCCCAAAT 6950 

AAATGGCT ATGACATAACACTATCTGAAGCTCGTTAAGTCTTTTGCTTCT 7000 

CTGATGTTT ATTCCTCTTAAAAAATGCTTATATATCAAAAAATTGTTTCC 7050 

TCGATTATAACAAGATTATATATGTATCTGTCGGTTTAGCTATGGTATAT 7100 

AATAT ATGT ATGTTCATGAGATTGGTCAAGAGAAATACTCACAAACAGTA 7150 

TATTAAGAAGGAAATATGTTTATGCATTAATTTAAGTTTCAAGATAAACT 7200 

GCAAAT AACCTCGACTAAAGTTGCAAAG ACCAAACACAAATTACAAAACT 7 250 

T AT AAG AC T T AAGT T CT G AATTCC CT AAAAC C AAAAAAAAAAAC AG AACA 7 300 

TATTTTGTTGCATCTACAAACAACACAAACCTACATAGTTTATAACTTAC 7 350 

TCATCACTGAGATTAACATCAGAATCATTCTCCATTTCTTCATCTTCACT 7 4 00 w 

CTCATCATCATCACCACCACCATGATGATTCTCCTCCTCTTCACGTAACC 7 4 50 

TAGCAATCTCACTCTGAGCTCTATCAACAATCTGCTTCTTCTGCAACTCC 2 500 

AAATCTCTCTG AAAATCAGCTCTCATCTTCTCCAACTCCTTCATTTGCTC 7 550 

TTTCTTACTCTTCTCCATCTTCTCATAAACCTTCCCAAACCTCTCAACAG ,J600 

AATCCGCCAACATCTTATACGAAGCAGCGTCATTAACCTTCTTCCTCTCG 7 650 

TACTCAACCTCATCATCCTCATCCTCCTCCTCTTCAGAATCACCAGGACT ,7700 

ATCCATCATCTCATCAAACCCATTAGACTT ATCT AAATAAACC7T AGTGT 77 50 

TCAT AAACACAAACTCACCTGAATCAAC ACCACAAGCTAAACCTAAATCC 7 900 

GACTTGGGCGAAACACAAAGCAACATATCCAACTTATTGAAAAACGACCA J 8 50 

TTTACTTGAACCTAAACCTGATTTC7CAACCTTAATCTTCTCTTTTCTAT 7 900 

ACTTCCTCTTCAAGTCATCAATCATTCTCCTACATTGCGTCTCAGATTTC 7 950 

TCCATCCTTAGCTCCTCACTCACTTTCTCAGCTACTTCATTCCAATCCTC 8000 

GTTCCTCAAACTCCTTCTACCCAATTGCAAAAACCTAtCTCCCCAAACTT 8050 

C AAGC AACACAA 8062 

Figure 11 (Continued) 
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Figure 12 



Comments/Roftrtnces: AtMSH3 3' side antisense : AtMSH3 3* (13 = 2104bp) from 
pUCl8/13 Sal1/Sst1/T4 into pCW164 BamHl/T4 in Agrobacterium LBA4404 
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Figure 13 



Comments/References: AtMSH6 (S8) 3 f side antisens : 62 Sal1/Sst1/T4 (1379bp) 
into pCW!64 8amH1/T4 
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8210 EcoRl.Sad.Kpnl 



Figure 14 



Comments/References: AtMSH6 3VAtMSH3 3' antisense : AtMSH6 (S8) 3' side (62=1379bp) 
Sal1/Sst1/T4 into pPF13 ( P CW164 AtMSHS (S5) 3' side (13=2104) antisens)/Sma1 in 
LBA4404 
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8210 EcoRl.Sad. Kpn1 



Figure 15 



Comments/References: AtMSH3 37AtMSH6 3* antisens (D) : AtMSH3 (SS) 3* side 
(13=2104bp) Sal1/Sst1/T4 into pPFl4 (AtMSH6 (S8) 3'side (62=1379bp) antisense into 
pCW164)/Smal. in LBA4404 
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BamH1 10554, 




8210 EcoRl.Sacl. Kpnl 



Figure 16 



Comments/References: AtMSH3 (S8) complete, sense orientation : pPF26 (3342bp) 
Sma1 into pCW164 Sma1 
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Figure 17 



CommoAts/Asltrtneii: pPZP111 wilh codA EcoRl catsette in EcoRl site and Hlnd3 
GUS cassette in Hind3 site. Kan ft AO genes under Promoter/terminator 35 S 
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Figure 18 
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drive 



pNos-Hyg-SCV 10.2 kb 



Gm/Km 



RK2 
ori 
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Sacl 






II 


Smai 

I 








DMC1 promoter 1.8 kb 


t-NOS 
0.25 kb 









Eco Rl 



Eco Rl 



Amp 

ColE1 
ori 

V 



Eco RV ^ Pst I earn HI 



Eco Rl 
Smal 

/ - 



LB pNOS 

0.35 kb 



Hyg M t-NOS 

1.1 kb 0.25 kb 
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RK2 
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SEQUENCE LISTING 



<110> 



<220> 
<223> 



yssinet| Georges; Perez, Pascual. 



<X20> Methods for obtaining plant varieties 

<130> 395498C 

<150> P09745 
<151 > 1997-10-10 

<160> 99 

<210> i 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<221> modified base 

<222> ii 

<223> i 
<220> 

< 2 2 1 > modi f ied_base 
<222> 14 

<223> r 

<220> 

< 2 2 1 > modi f ied_base 
<222> 17 

<223> t 



ACMSH6" Ce ° li » OBUcl -<*«" 0-PMU used to isolate AtMSH3 and 



<300> 

<30i> Reenan and Kolodner 

<302> Genetics 

<303> 132 

<306> 963-973 

<307> 1992 

<400> i 

ctggatccac nggnccnaay atg 

<210> 2 

<211> 23 

<212> DNA 



23 
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<213> Arcificial sequence 
<220> 

<22l> modif ied_base 

<222> 15 

<223> I 

<220> 

< 2 2 1 > modi f ied_base 

<222> 18 

<223> I 

<220> 

<223> Degenerate oligonucleotides DOMU used to isolate AtMSH3 and 
AtMSH6 . 

<300> 

<301> Reenan and Kolodner 

<302> Genetics 

<303> 132 

<306> 963-973 * 

<307> 1992 

<400> 2 

ctggatccrt artgngtnrc raa 23 



<210> 3 

<211> 24 

<212> DNA x 

<213> Artificial sequence 

<220> 

<223> MSH3 specific primer 636 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 3 

tgctagtgcc tcttgcaagc teat 24 



<210> 4 
<211> 27 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer API for PCR using cDNA of Arabidopsis thaliana ecotype 

Columbia containing adapter sequences ligated to both its 
ends 

<400> 4 
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ccaccctaat acgactcact atagggc 

<210> 5 

<211> 23 

<212> DNA 

<213> Artificial sequence 



27 



<220> 
<223> 



Primer AP2 for PCR using cDNA of Arabidopsis thaliana ecotype 
Columbia containing adapter sequences ligaced to both its 
ends 



<400> 5 
actcactata gggctcgagc ggc 



<210> 6 

<211> 30 

<212> DNA 

<213> Artificial sequence 



23 



<220> 
<223> 



MSH3 specific primer S525 for PCR using cDNA of Arabidopsii 
thaliana ecotype Columbia 



<400> 6 

aggttctgat tratgtgtgac gctttactta 

<210> 7 

<211> 29 

<212> DMA 

<213> Artificial sequence 



30 



<220> 
<223> 



MSH3 specific primer S51 for PCR using cDNA of Arabidopsis 
thaliana ecocype Columbia 



<400> 7 

ggatcgggta ctgggttttg agtgtgagg 29 

<210> 8 

<211> 24 

<212> DNA 

<213> Artificial sequence 



<220> 
<223> 



MSH3 specific primer 635 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 



JSDOCIO <WO 9919492A2 I > 



WO 99/19492 



PCT/EP98/06977 



<400> 8 

gcacgtgcct gatggtgttc ccac 24 

<210> 9 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S523 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 9 

tcagacagta tccagcatgg cagaagta 28 

<210> 10 * 

< 2 1 1 > 3 3 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer 1S5 for PCR using cDNA of Arabidopsis 
thaliana ecocype Columbia 

<400> 10 

atcccgggac gggcaagcaa aagcagcaga cga 3 3 

<210> 11 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S53 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 11 

gacaaagagc gaaatgaggc cccttgg 27 



<210> 12 

<211> 1250 

<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 

<223> Clone 52 



WSDOCID: <WO 9919492A2 1 > 
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<400> 12 

cccgggatgg gcaagcaaaa gcagcagacg acctctcgtt tcttcgctcc caaacccaaa 60 

tccccgaccc acgaaccgaa tccggcagcc gaaccatcaa caccgccacc gaagacatcc 120 

gccaccgcat ccccccctcc tcccaagcgc aagcctcccc ccgaccacct cgccgccgcg 180 

tcacccaaaa agcccaaacc ttctcctcac actcaaaacc cagtacccga ccccaaccca 240 

caccaaagat ctccccagag atctccggaa ccctcgccgg aggaacacgc ccccgaaacg 300 

ccaccatcga ggaaacacac accatcggaa cagcaagtgg cggagccaaa gagcaagcac 3 60 

ccagatgtgg tttcgacggc ggaagttggt tacaggcaca gactccccgg agaagacgcg 420 

gagatcgcag cacgcgcgcc gggcacccac gcccacatgg accacaattc cacgacggcg 480 

agcgcgccaa cattccgatt gaatctccac gcgagaagac cggcgaacgc aggacacaag 540 

m 

actggcgcag -gaagcagac cgaaaccgca gccaccaagc cccacggcgc aaaccggacc 600 

ggccctc::: "ccggggacc gccggcgccg cataccaaag ccacgcctga agcggccgag 660 

gacacaagcg gcggtcgcgg tggtgaagaa ggctctggcc cacagagtaa ccccccggct 720 

tgcgccgcgg atgagagagt taagccggag acaccaggcc gcggcaccga aacgagccct 780 

gacgccagag ccggcgccgc tggcgccgaa atctcgacag gtgaagccgt tcacgaagag 840 

ttcaatgaca acctcacgag aagcggacca gaggctgtga ccttgagctt gtcaccagct 900 

gagccgccgc tcggccagcc tctttcacaa caaaccgaga agtcctcggt ggcacacgct 960 

ggacctacct caaacgctcg agcggaacgc gccccaccgg accgccccag caacggcaac 1020 

gcagcagacg aggccacccc accacgcgaa aaaaccagcg caggcaaccc agaagacgac 1080 

aaagaaacga agccggaggc tgccgaaaaa ggaacgcccc gcccgacagc tcacacaacc 1140 

acgaacatgc cacacccgac cgtccaagcc cccgccccaa cgccccgcca ccccaaacag 12 00 

cccggattcg aaaggatccc ccaccaaggg gcctcacccc gccccccgcc 12 50 

<210> 13 
<211> 34 
<212> DMA 

<213> Arcificial sequence 

<220> 

<223> MSH3 specific primer 2S5 for PCR using cD&A of Arabidopsis 

thaliana ecocype Columbia 
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<400> 13 

accccgggcc aaaacgaaca agccggcttt agtc 34 

<210> 14 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S52 for PCR using cDNA of Arabidopsis 
chaliana ecocype Columbia 

<400> 14 

gccacatctg actgttcaag ccctcgc 27 



<210> 
<211> 
<212> 
<213> 
<223 > 


1 5 

2110 
DNA 

Arabi dcps 
Clone 13 


is zhai iana 


ecocype Colurrisia 


■r 


m 


< 4 00 > 


15 








• 'i - 




gccacatctg 


actgttcaag 


ccctcgccct 


aacgttttgc 


catctcaaac 


agcttggatt 


60 


cgaaaggatc 


ccctaccaag 


gggect cat t 


tegctctttg 


tcaagtaaca 


cagagatgac 


120 


ccccccagcc 


aacaccctgc 


aacagtegga 


ggttgtgaaa 


aataattcag 


atggatcgga 


180 


atctggcccc 


teat cccaca 


atatgaacca 


cacacttaca 


gtatatggtt 


ccaggcttct 


240 


cagacaccgg 


gtgactcatc 


ctctacgcga 


tagaaatttg 


atacctgctc 


ggcttgatgc 


300 


tgtttctgag 


atttctgett 


gcatgggatc 


tcatagttct 


tcccagctca 


gcagtgagtt 


360 


ggttgaagaa 


ggttctgaga 


gagcaatcgt 


atcacctgag 


ttttatctcg 


tgctctcctc 


420 


agtcctgaca 


gctatgtcta 


gatcatctga 


tattcaacgt 


ggaataacaa 


gaatctttca 


480 


tcggactgct 


aaagccacag 


agttcatcgc 


agttatggaa 


gctattttac 


ttgcggggaa 


540 


gcaaactcag 


eggcttggea 


taaagcaaga 


ctctgaaatg 


aggagtatgc 


aatctgeaac 


600 


tgcgcgatct 


actctcttga 


gaaaattgat 


ttctgttatt 


ccatcccctg 


ttgtggttga 


660 


caacgccgga 


aaacttctct 


ctgccctaaa 


taaggaagcg 


geegttcgag 


gtgacttget 


720 


cgacatacta 


atcacctcca 


gcgaccaatt 


tcctgagctt 


gctgaagctc 


gecaagcagt 


780 


tttagtcatc 


agggaaaagc 


tggattcctc 


gatagcttca 


cctcgcaaga 


agcccgctat 


840 
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ccgaaac c eg 


Qaacctcttc 

^7 w w w 


ddyt.yt.cyyg 


gat cacacac 


t tgatagagc 


tgcccgttga 


900 


c cccaaggcc 


cccacgaat c 


yyy i-yaady t 


aaacagcacc 


aagaagacta 


ttcgatatca 


960 


ccccccagaa 


acagcagctg 


rrr r t ci era r a a 


yccagcccta 


gcaaccgaac 


accctgccac 


1020 


tgtgaaccga 


gcttCQtago 


a t acr trrrrr 


Laayay C C CC 


agtagacact 


acacagatct 


1080 


caaggctgcc 


gcccaagccc 


1 1 crc t cr c a c* r 


yyacLyLLLy 


to j_ «a to to to 

caccccctct 


caactctatc 


1140 


tagaaacaag 


aactatatcc 


crt ccc^craGrt t 

y wv»w^»y»y l L 


tgeggacgae 


tgcgaaccag 


ttgagataaa 


1200 


catacagtct 


cjcrt cert cat c 

v» w w u v— v« 


w >— y ucil tyya 


gac t acacca 


caagacaact 


tcgtcccaaa 


1260 


cgacacaac t 


v» l. y ^— C& V— y v- d y 


ddyy y y aawa 


c tgecaaac t 


atcaccggac 


ctaacatggg 


1320 


acrcraaacjacrc 




y uuaay wtyc 


tototoa?atototo#^^w 

l "caacctcc 


ataatggctc 


aggttggttc 


1380 


ctttcrtacca 


y <-y LLaLLty 


ccaagccgca 


cgtgcttgat 


ggtgtcttca 


ctcggatggg 


1440 


r cr f t~ r raaA r 
i»y v« i— d y ci v_ 


»y LaL L Lay C 


atggcagaag 


cacctc tcta 


gaagaattaa 


gtgaagcgtc 


1500 


acacacaatc 


a^aau<k. L y L. L 


/^» f~ /— • f mm 

LLLL.LCjl.Lw 


get tgt tana 


t tagatgagc 


ttggaagagg 


1560 


cactaQcaca 

c* w v» ay v.a^a 




l ageca wCgc 


etatgeaaca 


t tacagcacc 


tcctagcaga 


1620 


aaacraaacat 


r r rrrr r* rrfrr 


u LylCaCyCa 


t taccctgaa 


atagctgaga 


teagtaaegg 


1680 


a t t c c caaa t 


>— l ty u Ly y y d 


LaLaCCaLyC 


S-* ^ ^Mto A torn 

ct cgtatccg 


acattgeaga 


aggataaagg 


1740 


cacrt t ataa t 


La wyauyciuy 


LyaCCLaCCC 


atataagct t 


gtgcgtggtc 


tttgcagcag 


1800 


cracrct t toat 


1 1 1 a a crcr 1~ f n 


L L Lay L v. Ly C 


ccagatacc t 


ccatcatgta 


taegtcgage 


1860 


U U L L L ^ — C* SUM 


y v- LyLdaaa L 


eggaagcuga 


ggtacgtgca 


agagagagaa 


acacacgcat 


1920 


gggagaacca 


gaaggacacg 


aagaaccgag 


aggegcagaa 


gaatctactt 


cggccctagg 


1980 


tgacttgttt 


gcagacctga 


aatttgetet 


ctccgaagag 


gacccttgga 


aagcattcga 


2040 


gtttttaaag 


catgcttgga 


agattgeegg 


caaaatcaga 


ctaaaaccaa 


ctegttcatt 


2100 


tcgacccggg 












2110 


<210> 
<211> 
<212> 


16 
29 
DNA 













<213> Artificial sequence 

<220> 



<223> MSH3 specific primer S51 for PCR using cDNA of Arabidopsis 

thaliana ecotype Columbia 



<400> 16 
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ggaccgggca ctgggt t ttg agtgtgagg 2 9 



<210> 17 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S525 for PCR using cDNA of Arabidopsis 
thaliana ecocype Columbia 

<400> 17 

aggttctgat tatgtgtgac gctttactta 30 



<210> 18 

<211> 3522 

< 2 1 2 > DNA 

<213> Arabidopsis ctaiiana ecocype Columbia * 
<220> 

<221> CDS 

<222> (100) .... (3342) 

<223> ACMSH3 full -length cDNA and deduced sequence of the encoded 
polypeptide 

<400> 18 

cccaagaaag cgcgcgaaaa ttggcaaccc aagttcgcca tagccacgac cacgaccttc 6 0 

catttctcrt aaacggagga gattacgaat aaagcaatt 99 

atg ggc aag caa aag cag cag acg att tct cgt ttc ttc get ccc aaa 14 7 

Met Gly Lys Gin Lys Gin Gin Thr lie Ser Arg Phe Phe Ala Pro Lys 
1 5 10 15 

ccc aaa tec ccg act cac gaa ccg aat ccg gta gec gaa tea tea aca 195 
Pro Lys Ser Pro Thr His Glu Pro Asn Pro Val Ala Glu Ser Ser Thr 
20 25 30 

ccg cca ccg aag ata tec gee act gta tec ttc tct cct tec aag cgt 243 
Pro Pro Pro Lys lie Ser Ala Thr Val Ser Phe Ser Pro Ser Lys Arg 
35 .40 45 

aag ctt etc tec gac cac etc gee gee gcg tea ccc aaa aag cct aaa 291 
Lys Leu Leu Ser Asp His Leu Ala Ala Ala Ser Pro Lys Lys Pro Lys 
50 55 60 

ctt tct cct cac act caa aac cca gta ccc gat ccc aat tta cac caa 33 9 

Leu Ser Pro His Thr Gin Asn Pro Val Pro Asp Pro Asn Leu His Gin 
65 70 75 80 
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aga ccc ccc cag aga tec ctg gaa ccc ccg ccg gag gaa cac gee ccc 
Arg Phe Leu Gin Arg Phe Leu Giu Pro Ser Pro Glu Glu Tyr Val Pro 

85 90 95 

gaa acg cca cca teg agg aaa tac aca cca ttg gaa cag caa gcq aca 
Glu Thr Ser Ser Ser Arg Lys Tyr Thr Pro Leu Glu Gin Gin Val Val 
100 105 no 

gag eta aag age aag tac cca gat gtg gtt ttg atg gtg gaa get ggt 
Glu Leu Lys Ser Lys Tyr Pro Asp Val Val Leu Met Val Glu Val Gly 
115 120 12 5 



ttg ggt att tac get cat atg gat cac aat ttc atg acg gcg agt gtg 
Leu Gly lie Tyr Ala His Met Asp His Asn Phe Met Thr Ala Ser Val 
145 150 



155 iso 



agt ttt gat gtt aga gtc ggt get gtt ggc gtt gaa att teg aca ggt 
Ser Phe Asp Val Arg Val Gly Val Val Gly Val Glu He Ser Thr Gly 
260 265 270 



387 



435 



483 



tac agg tac aga ttc ttc gga gaa gac gcg gag ate gca gca cgc gtg 531 
Tyr Arg Tyr Arg Phe Phe Gly Glu Asp Ala Glu He Ala Ala Arg Val 
130 135 140 



579 



627 



cca aca ttt cga ttg aat ttc cat gtg aga aga ctg gtg aat gca gga 
Pro Thr Phe Arg Leu Asn Phe His Val Arg Arg Leu Val Asn Ala Gly 
-65 17C 175 

tac aag act ggt gta gcg aag cag act gaa act gca gee att aag ccc 675 
Tyr Lys He Gly Val Val Lys Gin Thr Glu Thr Ala Ala lie Lys Ser 
180 185 190 

cat ggt gca aac egg acc ggc cct ttt ttc egg gga ccg ccg gcg ccg 723 
His Gly Ala Asn Arg Thr Gly Pro Phe Phe Arg Gly Leu Ser Ala Leu 
195 200 ' 205 

cac acc aaa gee acg ccc gaa gcg gec gag gac aca age ggc ggt tgt 77 1 

Tyr Thr Lys Ala Thr Leu Glu Ala Ala Glu Asp lie Ser Gly Gly Cys 
210 215 " 220 

ggt ggt gaa gaa ggt ttt ggt tea cag agt aac ccc ccg get tgt gtt 819 
Gly Gly Glu Glu Gly Phe Gly Ser Gin Ser Asn Phe Leu Val Cys Val 
225 230 235 240 

gtg gat gag aga gtt aag teg gag aca tta ggc tgt ggt att gaa atg 867 
Val Asp Glu Arg Val Lys Ser Glu Thr Leu Gly Cys Gly He Glu Met 
245 250 255 



915 



gaa gtt gtt tat gaa gag ttc aac gat aat ttc acg aga age gga CCa 963 
Glu Val Val Tyr Glu Glu Phe Asn Asp Asn Phe Met Arg Ser Gly Leu 
275 280 285 
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gag gcc gcg att ctg age teg tea cca gcc gag ccg ccg ctt ggc cag 1011 
Glu Ala Val He Leu Ser Leu Ser Pro Ala Glu Leu Leu Leu Gly Gin 
290 295 300 

cc: cc: cca caa caa ac: gag aag tec ccg gcg gca cat gcc gga ccc 1059 
Pro Leu Ser Gin Gin Thr Glu Lys Phe Leu Val Ala Met Ala Gly Pro 
305 310 315 320 

acc cca aac gtc cga gcg gaa cgt gcc cca ctg gac cgc ttc age aat 1107 
Thr Ser Asn Val Arg Val Glu Arg Ala Ser Leu Asp Cys Phe Ser Asn 
325 330 335 

ggc aac gca gca gac gag get act tea tta tgt gaa aaa ate age gca 1155 
Gly Asn Ala Val Asp Glu Val He Ser Leu Cys Glu Lys He Ser Ala 
340 345 350 

ggt aac tea gaa gat gat aaa gaa atg aag ctg gag get get gaa aaa 1203 
Gly Asn Leu Glu Asp Asp Lys Glu Met Lys Leu Glu Ala Ala Glu Lys 
355 360 365 

gga acg tec cgc ccg aca gcc cat aca acc acg aac acg cca cat ctg 125? 
Gly Met Ser Cys Leu Thr Val His Thr He Met Asn Met Pro His Leu 
370 375 380 

acc gcc caa gcc etc gcc cca acg ccc cgc cac c:c aaa cag ttt gga 1299 
Thr Val Gin Ala Leu Ala Leu Thr Phe Cys His Leu Lys Gin Phe Gly 
385 390 395 400 

ctt gaa agg ate ccc cac caa ggg gcc tea tct cgc cct ctg tea agt 1347 
Phe Glu Arg lie Leu Tyr Gin Gly Ala Ser Phe Arg Ser Leu Ser Ser 
405 410 415 

aac aca gag atg act ccc tea gcc aac acc ccg caa cag ccg gag gcc 1395 
Asn Thr Glu Mec Thr Leu Ser Ala Asn Thr Leu Gin Gin Leu Glu Val 
420 425 430 

gcg aaa aat aat tea gat gga ccg gaa tct ggc ccc tea ccc cat aat 144 3 

Val Lys Asn Asn Ser Asp Gly Ser Glu Ser Gly Ser Leu Phe His Asn 
435 440 445 

acg aac cac aca ctt aca gca tae get tec agg ccc ctt aga cac tgg 1491 
Met Asn His Thr Leu Thr Val Tyr Gly Ser Arg Leu Leu Arg His Trp 
450 455 460 

gtg act cat cct eta tgc gat aga aat ttg ata tct get egg etc gat 153 9 

Val Thr His Pro Leu Cys Asp Arg Asn Leu He Ser Ala Arg Leu Asp 
465 470 475 480 

get gtt tct gag ate tct get tgc atg gga tct cat agt tec tec cag 158*7 
Ala Val Ser Glu He Ser Ala Cys Met Gly Ser His Ser Ser Ser Gin 
485 490 495 
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1 1 



- S = K 2 E E E E E S 25 E E E E 

505 S1Q 



CCC gag CCt Cat CtC qCCT err rr-r- 

Pro Olu P„e T yr Leu "J E s« S« E Leu E E ^ "* *** 
515 1 Leu Thr Mec Ser Arg 

= £ E E S E 5 E E 5 E E = S ~ K 
£ E S E E S E E S E E E E E S E 



S5S S60 



E e e e e °e" e e e e r e 9aa ac9 ■» -« 

56 i y LyS Gln As P Ser Olu Met Arg Ser 



570 575 



585 590 



"It Til se" se" ^ £J E? f M W * *** C " « C tct 

595 -° Val Val jal Asp Asn Ala Giy Lys Leu Leu Ser 

600 60S 



lit Leu Tsn LyI S" S Ma ?* *f " C «■ 

610 J, 1 ? Val Arg As P Leu Asp H e Leu 

515 620 



S S E E E S E E E S E E E E S E 

" 0 635 640 

ss ss ^ g *** « tcc tcg aca g CC cca eet cgc 

lie Arg Glu Lys Leu Asp Ser Ser He Ala Ser Phe Arg 

6S0 655 

aag aag etc gec act cga aat tco oaa rr* 

Lys Lya " u e iia ~ 9 - £ s: E E E E E E E 



665 670 



S 2 2 S S S E E E S E S E S E 5 



680 685 



52 E E E E E E 2 E E E E ca < c " — — 

690 yS Thr Ile T ^ r His Pro Pro Glu 

695 700 



1635 



1683 



1731 



1779 



1827 



E E E E "r E E E E E E E aaa " 9 «« «« »• 

58 0 loc U ^ LyS LeU Ile S « 



1923 



1971 



2019 



2067 



2115 



2163 



2211 
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aca gca get 
lie Val Ala 
705 

acc gcg aac 
lie Val Asn 



cac tac aca 
Tyr Tyr Thr 



tgc ccg cac 
Cys Leu His 
755 

ccc gag etc 
Pro Glu Phe 
770 

ggt cgt cat 
Gly Arg His 
785 

aac gac aca 
Asn Asp Thr 



gga cct aac 
Gly Pro Asn 



act tec ata 
lie Ser He 
835 

aag ctg cac 
Lys Leu His 
850 

agt ate cag 
Ser He Gin 
865 

tea cac ata 
Ser His He 



gag ctt gga 
Glu Leu Gly 



ggc ttg gat 
Gly Leu Asp 
710 

cga get ccg 
Arg Ala Ser 
725 

gac ttc aag 
Asp Phe Lys 
740 

ccc ccc tea 
Ser Leu Ser 



gcg gat gac 
Val Asp Asp 



ccc gca ccg 
Pro Val Leu 

790 

acc teg cac 
lie Leu His 
805 

acg gga gga 
Met Gly Gly 
820 

acg gec cag 
Mec Ala Gin 



gcg ccc gac 
Val Leu Asp 



cac ggc aga 
His Gly Arg 
870 

acc aga acc 
He Arg Thr 
885 

aga ggc act 
Arg Gly Thr 
900 



gag eta gec 
Glu Leu Ala 



egg gac age 
Trp Asp Ser 



gec gec gec 
Ala Ala Val 
745 

acc cca ccc 
Thr Leu Ser 

760 

cge gaa cca 
Cys Glu Pro 
775 

gag ace aea 
Glu Thr He 



gca gaa ggg 
Ala Glu Gly 



aag age egc 
Lys Ser Cys 
825 

gec ggc ccc 
Val Gly Ser 
840 

ggc gec ccc 
Gly Val Phe 
855 

age acc ccc 
Ser Thr Phe 



cge ccc cct 
Cys Ser Ser 



age aca cac 
Ser Thr His 
905 



12 

cca gca ace 
Leu Ala Thr 
715 

etc ccc aag 
Phe Leu Lys 
730 

caa gee cee 
Gin Ala Leu 



aga aac aag 
Arg Asn Lys 



gee gag aca 
Val Glu He 
780 

tea caa gae 
Leu Gin Asp 
795 

gaa cac cge 
Glu Tyr Cys 
810 

cac acc cge 
Tyr lie Arg 



etc gca cca 
Phe Val Pro 



acc egg acg 
Thr Arg Met 
860 

cca gaa gaa 
Leu Glu Glu 
875 

cge teg ccc 
Arg Ser Leu 
890 

gac ggc gca 
Asp Gly Val 



gaa cat ccc 
Glu His Leu 



age ttc agt 
Ser Phe Ser 
735 

gee gca ctg 
Ala Ala Leu 
750 

aac tac gtc 
Asn Tyr Val 
765 

aac ata cag 
Asn He Gin 



aac ttc gec 
Asn Phe Val 



caa ate acc 
Gin He He 
815 

caa gec gec 
Gin Val Ala 
830 

gcg cca ccc 
Ala Ser Phe 
845 

ggc gec cca 
Gly Ala Ser 



cca agt gaa 
Leu Ser Glu 



gtt ata tea 
Val He Leu 
895 

gec ate gec 
Ala He Ala 
910 



gee 2259 

Ala 

720 

aga 2 3 07 

Arg 



gac 2355 

Asp 



cgt 2403 
Arg 



tct 2451 
Ser 



cca 249^ 

Pro 

800 

acc 2 54 7 

Thr 

tea 2595 
Leu 



gee 2643 
Ala 



gac 2691 
Asp 



gcg 2739 

Ala 

880 

gac 2787 
Asp 



tat 2835 
Tyr 
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gca aca tea cag can etc eta gca gaa aag aga tgt ttg get ccc ttt 2883 
Ala Thr Leu Gin His Leu Leu Ala Glu Lys Arg Cys Leu Val Leu Phe 
915 920 4 925 

gee acg cat cac ccc gaa aca get gag ate age aac gga tec cca ggt 2931 
Val Thr His Tyr Pro Glu He Ala Glu He Ser Asn Gly Phe Pro Gly 
930 935 940 

tec get ggg aca tac cat gtc teg tat ctg aca ttg cag aag gat aaa 2979 
Ser Val Gly Thr Tyr His Val Ser Tyr Leu Thr Leu Gin Lys Asp Lys 
945 950 955 960 

ggc age tat gat cat gat gat gtg acc tac eta tat aag ctt gtg cgt 3027 
Gly Ser Tyr Asp His Asp Asp Val Thr Tyr Leu Tyr Lys Leu Val Arg 
965 970 * 975 

ggt ctt tgc age agg age ttt ggt ttt aag gtt get cag ctt gec cag 3075 
Gly Leu Cys Ser Arg Ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin 
980 985 990 

ata cct cca tea cgt ata cgt cga gec act tea atg get gca aaa ctg 312? 
lie Pro Pro Ser Cys lie Arg Arg Ala lie Ser Met Ala Ala Lvs Leu 
9 ^5 1000 1005 

gaa get gag gta cgt gca aga gag aga aac aca cgc atg gga gaa cca 3171 
Glu Ala Glu Val^ Arg Ala Arg Glu Arg Asn Thr Arg Met Gly Glu Pro 
^010 1015 1020 

gaa gga cat gaa gaa ccg aga ggc gca gaa gaa tec act teg get eta 3219 
Glu Gly His Glu Glu Pro Arg Gly Ala Glu Glu Ser He Ser Ala Leu 
102S 1030 1035 1040 

ggt gac ttg ttt gca gac ctg aaa ttt gec ccc ccc gaa gag gac cct 3267 
Gly Asp Leu Phe Ala Asp Leu Lys Phe Ala Leu Ser Glu Glu Asp Pro 
1045 1050 1055 



tgg aaa gca etc gag ttt tta aag cat get egg aag att get ggc aaa 
Trp Lys Ala Phe Glu Phe Leu Lys His Ala Trp Lys He Ala Gly Lys 
1060 1065 1070 



3315 



acc aga eta aaa cca act tgt cca ttt tgatctaatc ttaacattat 3362 
lie Arg Leu Lys Pro Thr Cys Ser Phe 
1075 1080 

ageaactgea aggtcttgat catctgttag ttgegtacta acttatgtgt actagtataa 3422 

caagaaaaga gaattagaga gatggattct aatccggtgt cgcagtacat cttttctcca 3482 

cccgcataaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3 522 



<210> 19 
<211> 1081 
<212> PRT 



JSDOCID: <WO 9919492A2J_> 



WO 99/19492 



PCT/EP98/06977 



14 

<213> Arabidopsis chaliana ecotype Columbia 

<223 > Polypeptide MSH3 

<400> 19 

Met Gly Lys Gin Lys Gin Gin Thr lie Ser Arg Phe Phe Ala Pro Lys 
15 10 15 

Pro Lys Ser Pro Thr His Glu Pro Asn Pro Val Ala Glu Ser Ser Thr 
20 25 30 

Pro Pro Pro Lys lie Ser Ala Thr Val Ser Phe Ser Pro Ser Lys Arg 
35 40 45 

Lys Leu Leu Ser Asp His Leu Ala Ala Ala Ser Pro Lys Lys Pro Lys 
50 55 60 

Leu Ser Pro His Thr Gin Asn Pro Val Pro Asp Pro Asn Leu His Gin 
65 70 75 80 

Arg Phe Leu Gin Arg Phe Leu Glu Pro Ser Pro Glu Glu Tyr Val Pro 

35 90 95 

Glu Thr Ser Ser Ser Arg Lys Tyr Thr Pro Leu Glu Gin Gin Val Val 
100 105 110 

Glu Leu Lys Ser Lys Tyr Pro Asp Val Val Leu Met Val Glu Val Gly 
115 120 125 

Tyr Arg Tyr Arg Phe Phe Gly Glu Asp Ala Glu He Ala Ala Arg Val 
130 135 140 

Leu Gly He Tyr Ala His Met Asp His Asn Phe Met Thr Ala Ser Val 
145 150 155 160 

Pro Thr Phe Arg Leu Asn Phe His Val Arg Arg Leu Val Asn Ala Gly 
165" 170 175 

Tyr Lys He Gly Val Val Lys Gin Thr Glu Thr Ala Ala He Lys Ser 
180 185 190 

His Gly Ala Asn Arg Thr Gly Pro Phe Phe Arg Gly Leu Ser Ala Leu 
195 200 205 

Tyr Thr Lys Ala Thr Leu Glu Ala Ala Glu Asp He Ser Gly Gly Cys 
210 215 220 

Gly Gly Glu Glu Gly Phe Gly Ser Gin Ser Asn Phe Leu Val Cys Val 
225 230 235 240 

Val Asp Glu Arg Val Lys Ser Glu Thr Leu Gly Cys Gly He Glu Met 
245 * 250 255 
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Ser Phe Asp Val Arg Val Gly Val Val Gly Val Glu lie Ser Thr Gly 
260 265 270 

Glu Val Val Tyr Glu Glu Phe Asn Asp Asn Phe Met Arg Ser Gly Leu 
275 280 285 

Glu Ala Val lie Leu Ser Leu Ser Pro Ala Glu Leu Leu Leu Gly Gin 
290 295 300 

Pro Leu Ser Gin Gin Thr Glu Lys Phe Leu Val Ala Met Ala Gly Pro 
305 310 315 " 320 

Thr Ser Asn Val Arg Val Glu Arg Ala Ser Leu Asp Cys Phe Ser Asn 
325 330 335 

Gly Asn Ala Val Asp Glu Val lie Ser Leu Cys Glu Lys lie Ser Ala 
340 345 350 

Gly Asn Leu Glu Asp Asp Lys Glu Met Lys Leu Glu Ala Ala Glu Lys 
355 360 365 

Gly Met Ser Cys Leu Thr Val His Thr He Met Asn Met Pro His Leu 
370 375 380 

Thr Val Gin Ala Leu Ala Leu Thr Phe Cys His Leu Lys Gin Phe Gly 
385 390 395 400 

Phe Glu Arg He Leu Tyr Gin Gly Ala Ser Phe Arg Ser Leu Ser Ser 
405 410 415 

Asn Thr Glu Met Thr Leu Ser Ala Asn Thr Leu Gin Gin Leu Glu Val 
420 425 430 

Val Lys Asn Asn Ser Asp Gly Ser Glu Ser Gly Ser Leu Phe His Asn 
435 440 445 

Met Asn His Thr Leu Thr Val Tyr Gly Ser Arg Leu Leu Arg His Trp 
450 455 460 

Val Thr His Pro Leu Cys Asp Arg Asn Leu He Ser Ala Arg Leu Asp 
465 470 475 480 

Ala Val Ser Glu He Ser Ala Cys Met Gly Ser His Ser Ser Ser Gin 
485 490 495 

Leu Ser Ser Glu Leu Val Glu Glu Gly Ser Glu Arg Ala He Val Ser 
500 505 ~ 510 

Pro Glu Phe Tyr Leu Val Leu Ser Ser Val Leu Thr Ala Met Ser Arg 
515 520 525 

Ser Ser Asp He Gin Arg Gly He Thr Arg He Phe His Arg Thr Ala 
530 535 540 
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Lys Ala Thr Glu Phe He Ala Val Met Glu Ala He Leu Leu Ala Gly 
545 . 550 555 560 

Lys Gin He Gin Arg Leu Gly lie Lys Gin Asp Ser Glu Mec Arg Ser 
565 570 575 

Met Gin Ser Ala Thr Val Arg Ser Thr Leu Leu Arg Lys Leu He Ser 
580 585 590 

Val He Ser Ser Pro Val Val Val Asp Asn Ala Gly Lys Leu Leu Ser 
595 600 605 

Ala Leu Asn Lys Glu Ala Ala Val Arg Gly Asp Leu Leu Asp He Leu 
610 615 620 

He Thr Ser Ser Asp Gin Phe Pro Glu Leu Ala Glu Ala Arg Gin Ala 
625 630 635 640 

Val Leu Val He Arg Glu Lys Leu Asp Ser Ser He Ala Ser Phe Arg 
645 650 655 

Lys Lys Leu Ala He Arg Asn I,~u Glu Phe Leu Gin Val Ser Gly He 
660 665 670 

Thr His Leu lie Glu Leu Pro Val Asp Ser Lys Val Pro His Asn Trp 
675 680 685 

Val Lys Val Asn Ser Thr Lys Lys Thr He Arg Tyr His Pro Pro Glu 
690 695 700 

lie Val Ala Gly Leu Asp Glu Leu Ala Leu Ala Thr Glu His Leu Ala 
705 710 715 720 

He Val Asn Arg Ala Ser Trp Asp Ser Phe Leu Lys Ser Phe Ser Arg 
725 730 735 

Tyr Tyr Thr Asp Phe- Lys Ala Ala Val Gin Ala Leu Ala Ala Leu Asp 
740 745 750 

Cys Leu His Ser Leu Ser Thr Leu Ser Arg Asn Lys Asn Tyr Val Arg 
755 760 765 

Pro Glu Phe Val Asp Asp Cys Glu Pro Val Glu He Asn He Gin Ser 
770 775 780 

Gly Arg His Pro Val Leu Glu Thr He Leu Gin Asp Asn Phe Val Pro 
785 790 795 800 

Asn Asp Thr He Leu His Ala Glu Gly Glu Tyr Cys Gin He He Thr 
805 810 815 

Gly Pro Asn Met Gly Gly Lys Sar Cys Tyr He Arg Gin Val Ala Leu 
820 825 830 
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lie Ser He Met: Ala Gin Val Gly Ser Phe Val Pro Ala Ser Phe Ala 
335 840 845 

Lys Leu His Val Leu Asp Gly Val Phe Thr Arg Mec Gly Ala Ser Asp 
850 855 860 

Ser He Gin His Gly Arg Ser Thr Phe Leu Glu Glu Leu Ser Glu Ala 
365 870 875 880 

Ser His He He Arg Thr Cys Ser Ser Arg Ser Leu Val He Leu Asp 
885 890 895 

Glu Leu Gly Arg Gly Thr Ser Thr His Asp Gly Val Ala He Ala Tyr 
900 905 910 

Ala Thr Leu Gin His Leu Leu Ala Glu Lys Arg Cys Leu Val Leu Phe 
915 920 925 

Val Thr His Tyr Pro Glu lie Ala Glu He Ser Asn Gly Phe Pro Gly 
930 935 940 

Ser Val Gly Thr Tyr His Val Ser Tyr Leu Thr Leu Gin Lys Asp Lys 
945 950 955 960 

Gly Ser Tyr Asp His Asp Asp Val Thr Tyr Leu Tyr Lys Leu Val Arg 
965 970 975 

Gly Leu Cys Ser Arg Ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin 
980 985 990 

He Pro Pro Ser Cys He Arg Arg Ala He Ser Mec Ala Ala Lys Leu 
995 1000 1005 

Glu Ala Glu Val Arg Ala Arg Glu Arg Asn Thr Arg Met Gly Glu Pro 
1010 1015 1020 

Glu Gly His Glu Glu Pro Arg Gly Ala Glu Glu Ser He Ser Ala Leu 
1025 1030 1035 1040 

Gly Asp Leu Phe Ala Asp Leu Lys Phe Ala Leu Ser Glu Glu Asp Pro 
1045 1050 1055 

Trp Lys Ala Phe Glu Phe Leu Lys His Ala Trp Lys He Ala Gly Lys 
1060 1065 1070 

He Arg Leu Lys Pro Thr Cys Ser Phe 
1075 1080 



<210> 20 

<211> 24 

<212> DMA 

<213> Artificial sequence 
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<220> 

<223> MSH6 specific primer 638 for PCR using cDNA of Arabidopsis 

thaliana ecotype Columbia 

<400> 20 

cctccaccag gcgacgaaaa accg 24 

<210> 21 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer S81 for PCR using cDNA of Arabidopsis thaliana ecotype 
Columbia 

<400> 21 

cgccgccrtt agcaccccct cccttcac £8 

<210> 22 

<211> 3 0 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer S823 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 22 

gcttggcgca tctaatagaa tcatgacagg 3 0 

<210> 23 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer 637 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 23 

gacagcgtca gttcttcaga atgc 24 

<210> 24 
<211> 33 
<212> DMA 
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<213> Artificial sequence 



< 2 2 0 - 



MSH6 specific primer 1S9 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 



<400> 24 

atcccgggat gcagcgccag agatcgactt tgt 

<210> 25 
<211> 27 
<212> DNA 

<213> Artificial sequence 



33 



<22Q> 
<223> 



MSH6 specific primer S83 for PCR using cDNA of Arabidopsis 
thaliana- ecotype Columbia 



<40C> 25 

cgc:a:cca: ggctgccccg aaccgag 

<21C> 26 

<211> 1385 

<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 

<223> Clone 43 

< 4 0 C > 2 6 



27 



60 



cccgggatgc agcgccagag atcgattttg tctttcttcc aaaaacccac ggcggcgact 

acgaagggtt tggcttccgg cgatgctgct agcggcgggg gcggcagcgg aggaccacga 120 

cttaatgtga aggaagggga tgctaaaggc gacgcttccg tacgttttgc tgtttcgaaa 180 

tctgtcgatg aggttagagg aacggatact ccaccggaga aggttccgcg tcgtgtcctg 240 

ccgcctggat ttaagccggc tgaatccgcc ggtgatgcct cgtccctgtt ctccaatatt 300 

atgcataagt ttgtaaaagt cgatgatcga gattgttctg gagagaggag ccgagaagat 3 60 

gttgttccgc tgaatgattc atctctatgt atgaaggcca atgatgttat tcctcaattt 420 

cgtcccaata acggtaaaac tcaagaaaga aaccatgctt ttagtttcag tgggagagct 480 

gaacttagat cagtagaaga tataggagta gatggcgatg ttcctggtcc agaaacacca 540 

gggatgcgtc cacgtgcttc tcgcttgaag cgagttctgg aggatgaaat gacttttaag 600 
gaggacaagg t tcctgtatt ggactctaac aaaaggctga aaatgctcca ggatccggtt 



660 
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cgcggagaga agaaagaagt aaacgaagga 
accagggacg ccaatagaag acgtcccgat 
ataccacctg atgttttcaa gaaaatgtcc 
agtgaacata tggacatcgt gcctt tec 
ctagacgcgg aaccaggcca caaggagcct 
aaatgcagac aggttggtac ctctgaaagc 
gctcgtggat ataaagctgg acgaaccgag 
agaggtgcca acactacaac cccaaggaag 
agcgagggaa acatcgggcc tgatgccgtc 
gagctacaaa agtgttcaac cgcgcacgga 
t: ct ~ggg~ ~ g ggcccaccag cgacgacgca 
caggcccccc caaaggaagc gccaca-gac 
gccccaagga aatatacgc: gacagggticc 
gtaacggggg acacagatgc tgctggagtc 
aaaggctccc ccgaaccatg gaactgcgcc 
ctcagcgccc tcggagagct aaccaaccac 
aagcacgggg atatttttcc ataccaagct 
acgacggtaa atcttgagat atttaacaac 
cacaaacacc tcgataactg tgttagtcca 
cgccacccac tcaaagatgt agaaagcatc 
acggcaaacc cagaaagtat gcaaatcacc 
gaaagactgc tcggacgcat caagtccagc 
cttccgggga aaaaagtgct gaaacaacga 
ttcagaagcg gaactgatcc gttgccggcc 
ctctataaac tctgtaaact tcctatacca 
tctcaacccg aagcagccac agacagcg 
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accaaacccg aacggcttga gtcttctcga 720 

gatccccttt acgatagaaa gaccttacac 730 

gcatcacaaa agcaacactg gagtgttaag 840 
aaagtgggga aatcctatga gccgtatgag 900 
gactggaaga tgaccacgag tggtgtggga 96 0 

gggacagatg aggcagcgca aaagctatta 1020 

cagctagaaa catctgacca agcaaaagcc 1080 

ctagctcagg tattaactcc atcaacagca 1140 

caccctcttg ccacaaaaga gatcaaaacg 1200 

crcgcctttg tccjactgtgc cgccctgagg 1250 

ccatctgccg cccccggagc gtcactgacg 1320 

agcaaagggc tatcaagaga agcacaaaag 13 30 

acggcggcac agtcggctcc agtaccacaa 1440 

agaaatataa tagaatccaa cggacaccnt 1500 

gttgatggcc taaatgaatg cgatgccgcc 1550 

ccgrccaggc caaagctaga agatgtacct 15 20 

tacaggggtc gcctcagaat cgatggccag 15 80 

agccgcgatg gcggtccttc agggacctcg 1740 

actggtaagc gacccttaag gaactggacc 18 00 

aacaaacggc ccgatgtagc tgaagaattc i860 

ggccagcacc tccacaaact cccagactta 1920 

gttcgatcac cagcctctgt gttgcccgct 1980 

gttaaagcat ttgggcaaat tgtgaaaggg 2 040 

ctacagaagg aaccaaatat gacgagttcg 2100 

gtaggaaaaa gcgggctaga gttacttctc 2160 

2188 
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<210> 27 
<211> 1385 
<212> DMA 

<213> Arabidopsis chaliana ecotype Columbia 

<223> Clone 62 

<400> 27 

catcagcctc cgcgtcgcct gctcttctgg ggaaaaaagc gccgaaacaa cgagccaaag 60 

cacctgggca aactgcgaaa gggttcagaa gcggaattga cccgccgttg gctccacaga 120 

aggaaccaaa taegacgagt ttgctccata aaccctgtaa acctcccata ttagtaggaa ISO 

aaagcgggcc agagccatcc ctucccaat ccgaagcagc cacagacagc gaccttccaa 240 

attaccagaa ccaagacgeg acagacgaaa acgctgaaac tcccacaata cctatcgaac 3 00 

trttcaccga aagagcaact caatggtctg aggccaccca caccataagc cgcccagacg 360 

ccccgagacc ttccgcaatc gcagcaagcc tccccgccgg aagcacggcc aggcctgtca 42*0 

ccccccccga atcagaagct acaga=caga accagaaaac aaaagggcca acacccaaaa 480 

cccaaggacc acggcaccca cttgcagccg cagccgacgg ccaaccgcct gccccgaacg 540 

acacaccccc cggcgaggcc agaagaagca gcggcagcac ecacccccgg ccactgctac 600 

cgacgggacc aaacacgggc ggaaaaccaa ctccccctcg tgcaacaegt ccggccgcca 660 

cctccgccca actcggccgc cacgtgccgc gcgagtctcg cgaaaccccc ctcgtggata 720 

ccacccccac aaggcccggc gcatctgata gaatcacgac aggagagagc accctcccgg 780 

cagaacgcac cgagacagcg ccagctcctc agaacgcaac ccaggaccca ctagcaatcc 840 

ccgacgaact gggcagagga actagtactt tcgatggaca cgccaccgca cacccggctt 900 

cecgccaccc ggtagagaaa gttcaatgcc ggatgcccct cgcaacacac taccaccccc 960 

ccaccaagga attcgcgtcc cacccacgtg tcaccccgaa acacatggcc tgcgcaccca 1020 

aaccaagatc tgattatcaa ccacgcggcc gcgaccaaga cccagtgctc ttgtaccgcc 1080 

taaccgaggg agcttgtccc gagagccacg gacttcaagc ggcactcatg gccggaacac 1140 

caaaccaagt ggttgaaaca gcatcaggcg ctgctcaagc catgaagaga tcaatcgggg 1200 

aaaacttcaa gtcaagcgag ctaagatctg agccctcaag cccgcacgaa gaccggccca 1260 

agtcatcggc gggtacctcc cgagtcgccc acaacaacgc ccccaccggc gaagatgacc 13 2 0 

acgacacctt gttttgccc.i tggcacgaga tcaaaccccc ceactgtgtt cccaaacaac 1380 
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ccggg 13 8 5 

<210> 28 

<211> 34 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer 2S8 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 26 

atcccgggtt atttgggaac acagtaagag gatt 34 

<210> 29 

<211> 27 

<212> DNA 

<213> Artificial sequence » 
<22C> 

<223> MSH6 specific primer S82 fcr PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 29 

gcgttcgatc atcagcctct gtgttgc 27 

<210> 30 

<211> 3606 

<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 
<220> 

<221> CDS 

<222> (142) (3468) 

<223> ACMSH6 full-length cDNA and deduced sequence of the encoded 
polypeptide 

<400> 30 

aaaagttgag ccctgaggag tatcgtttcc gccatttcta cgacgcaagg cgaaaatttt 6 0 

tggcgccaat ctttcccccc tttcgaattc tctcagctca aaacatcgtt tctctctcac 120 

tctctctcac aattccaaaa a atg cag cgc cag aga teg att ttg tct ttc 171 

Met Gin Arg Gin Arg Ser lie Leu Ser Phe 
15 10 
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tec caa aaa 
Phe Gin Lys 



get get age 
Ala Ala Ser 



gaa ggg gat 
Glu Gly Asp 
45 

tec gtc gat 
Ser Val Asp 
60 

cgt cgt gtc 
Arg Arg Val 

75 

get teg ccc 
Ala Ser Ser 



gat cga gat 
Asp Arg Asp 



aat gat tea 
Asn Asp Ser 
125 

egt tec aat 
Arg Ser Asn 
140 

agt ggg aga 
Ser Gly Arg 
155 

gat gtt cct 
Asp Val Pro 



ttg aag cga 
Leu Lys Arg 



cct gta ttg 
Pro Val Leu 
205 



ccc acc gcg 
Pro Thr Ala 
15 

ggc ggg ggc 
Gly Gly Gly 
30 

get aaa ggc 
Ala Lys Gly 



gag gtt aga 
Glu Val Arg 



ctg ccg tct 
Leu Pro Ser 
80 

ctg ttc tec 
Leu Phe Ser 
95 

tgt tct gga 
Cys Ser Gly 
110 

tct eta . tgt 
Ser Leu Cys 



aat ggt aaa 
Asn Gly Lys 



get gaa ctt 
Ala Glu Leu 
160 

ggc cca gaa 
Gly Pro Glu 
175 

gtt ctg gag 
Val Leu Glu 
190 

gac tct aac 
Asp Ser Asn 



gcg act acg 
Ala Thr Thr 



ggc age gga 
Gly Ser Gly 
35 

gac get tct 
Asp Ala Ser 
50 

gga acg gat 
Gly Thr Asp 
65 

gga ttt aag 
Gly Phe Lys 



aat a:: atg 
Asn :i~ Met 



gag agg age 
Glu Arg Ser 
115 

atg aag get 
Met Lys Ala 
130 

act caa gaa 
Thr Gin Glu 
145 

aga tea gta 
Arg Ser Val 



aca cca ggg 
Thr Pro Gly 



gat gaa atg 
Asp Glu Met 
195 

aaa agg ctg 
Lys Arg Leu 

;uo 



23 

aag ggt ttg 

Lys Gly Leu 
20 

gga cca cga 

Gly Pro Arg 



gta cgt ttt 
Val Arg Phe 



act cca ccg 
Thr Pro Pro 
70 

ccg get gaa 
Pro Ala Glu 
85 

cat aag ttt 
His Lys Phe 
100 

cga gaa gat 
Arg Glu Asp 



aat gat gtt 
Asn Asp Val 



aga aac cat 
Arg Asn His 
150 

gaa gat ata 
Glu Asp lie 
165 

atg cgt cca 
Met Arg Pro 
180 

act ttt aag 
Thr Phe Lys 



aaa atg etc 
Lys Met Leu 



gtt tec ggc 
Val Ser Gly 
25 

ttt aat gtg 
Phe Asn Val 
40 

get gtt teg 
Ala Val Ser 
55 

gag aag gtt 
Glu Lys Val 



tec gec gst 
Ser Ala Gly 



gta aaa gtc 
Val Lys Val 
105 

gtt gtt ccg 
Val Val Pro 
120 

att cct caa 
lie Pro Gin 
135 

get ttt agt 
Ala Phe Ser 



gga gta gat 
Gly Val Asp 



cgt get tct 
Arg Ala Ser 
185 

gag gat aag 
Glu Asp Lys 
200 

cag gat ccg 
Gin Asp Pro 
215 



gat 219 
Asp 



aag 267 
Arg 



aaa 315 
Lys 



ccg 363 
Pro 



gat 411 
Asp 

90 

* 

gat 459 
Asp 



ctg 507 
Leu 



ttt 555 
Phe 



ttc 603 
Phe 



ggc 6 51 

Gly 

170 

cgc 699 
Arg 



gtt 747 
Val 



gtt 795 
Val 
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cgt gga gag aag aaa gaa gca aac gaa gga acc aaa tec gaa egg etc 843 

Cys Gly Giu Lys Lys Glu Val Asn Glu Gly Thr Lys Phe Glu Trp Leu 

220 225 230 

gag ccc cc: cga acc agg gac gec aac aga aga cgc ccc gac gac ccc 891 

Glu Ser Ser Arg lie Arg Asp Ala Asn Arg Arg Arg Pro Asp Asp Pro 
235 240 245 250 

ctt tac gac aga aag acc cca cac aca cca ccc gat gtt ttc aag aaa 93 9 

Leu Tyr Asp Arg Lys Thr Leu His lie Pro Pro Asp Val Phe Lys Lys 
255 260 265 

acg tct gca tea caa aag caa tat tgg age gtt aag agt gaa tat atg 987 

Met Ser Ala Ser Gin Lys Gin Tyr Trp Ser Val Lys Ser Glu Tyr Met 

270 275 280 



gac att gcg ctt ttc ttt aaa gtg ggg aaa ttt tat gag ctg tat gag 
Asp lie Val Leu Phe Phe Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu 
295 290 295 



age gag gga aac ate ggg cct gat gee gtc cat ctt ctt get ata aaa 
Ser Glu Gly Asn lie Gly Pro Asp Ala Val His Leu Leu Ala lie Lys 
380 385 390 

gag ate aaa atg gag eta caa aag tgt tea act gtg tat gga ttt get 
Glu lie Lys Met Glu Leu Gin Lys Cys Ser Thr Val Tyr Gly Phe Ala 
395 400 405 410 

ttt gtt gac tgc get gee ttg agg ttt tgg gtt ggg tec ate age gat 
Phe Val Asp Cys Ala Ala Leu Arg Phe Trp Val Gly Ser lie Ser Asp 
415 420 425 



1035 



cca gac gcg gaa cca ggc cac aag gag ccc gac egg aag acg acc acg 108? 

Leu Asp Ala Glu Leu Gly His Lys Glu Leu Asp Trp Lys Mec Thr Met 

300 305 310 

agt ggc gcg gga aaa cgc aga cag gec ggc acc ccc gaa age ggg aca 1131 

Ser Gly Val Gly Lys Cys Arg Gin Val Gly lie Ser Glu Ser Gly lie 
315 320 325 330 

gac gag gca gcg caa aag cca cca gee cge gga tat aaa gtt gga cga 1179 

Asp Glu Ala Val Gin Lys Leu Leu Ala Arg Gly Tyr Lys Val Gly Arg 
335 340 345 

acc gag cag cca gaa aca ccc gac caa gca aaa gec aga ggt get aac 1227 

He Glu Gin Leu Glu Thr Ser Asp Gin Ala Lys Ala Arg Gly Ala Asn 
350 355 360 

act aca act cca agg aag eta gtt cag gta tta act cca tea aca gca 1275 
Thr lie He Pro Arg Lys Leu Val Gin Val Leu Thr Pro Ser Thr Ala 
365 370 375 



1323 



1371 



1419 
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gac gca cca tgt get get etc gga gcg tea ttg atg cag gtt tec cca 1467 

Asp Ala Ser Cys Ala Ala Leu Gly Ala Leu Leu Met Gin Val Ser Pro 
430 435 440 

aag gaa gcg tta tat gac agt aaa ggg eta tea aga gaa gca caa aag 1515 
Lys Glu Val Leu Tyr Asp Ser Lys Gly Leu Ser Arg Glu Ala Gin Lys 
445 450 455 

get eta agg aaa tat acg ttg aca ggg tct acg gcg gta cag ttg get 1563 
Ala Leu Arg Lys Tyr Thr Leu Thr Gly Ser Thr Ala Val Gin Leu Ala 
460 465 470 

cca gta cca caa gta atg ggg gat aca gat get get gga gtt aga aat 1611 
Pro Val Pro Gin Val Met Gly Asp Thr Asp Ala Ala Gly Val Arg Asn 
475 480 485 490 

ata ata gaa tct aac gga tac ttt aaa ggt tct tct gaa tea tgg aac 1659 
lie lie Glu Ser Asn Gly Tyr Phe Lys Gly Ser Ser Glu Ser Trp Asn 
495 500 505 

tgt get gtt gat ggt eta aat gaa tgt gat gtt gee ctt agt get ctt 1707 
Cys Ala Val Asp Gly Leu A sr. Glu Cys Asp Val Ala Leu Ser Ala Leu 
510 515 520 

gga gag eta att aat cat ctg cct agg eta aag eta gaa gat gta ctt 1755 
Gly Glu Leu lie Asn His Leu Ser Arg Leu Lys Leu Glu Asp Val Leu 
525 530 535 

aag cat ggg gat att ttt cca tac caa gtt tac agg ggt tgt etc aga 1803 
Lys His Gly Asp lie Phe Pro Tyr Gin Val Tyr Arg Gly Cys Leu Arg 
540 545 550 

act gac ggc cag acg atg gta aat ctt gag aca ttt aac aat age tgt 1851 
lie Asp Gly Gin Thr Met Val Asn Leu Glu lie Phe Asn Asn Ser Cys 
555 560 565 570 

gat ggt ggt cct tea ggg ace ttg tac aaa tat ctt gat aac tgt gtt 1899 
Asp Gly Gly Pro Ser Gly Thr Leu Tyr Lys Tyr Leu Asp Asn Cys Val 
575 580 585 

agt cca act ggt aag cga etc tta agg aat tgg ate tgc cat cca etc 1947 
Ser Pro Thr Gly Lys Arg Leu Leu Arg Asn Trp lie Cys His Pro Leu 
590 595 600 

aaa gat gta gaa age ate aat aaa egg ctt gat gta gtt gaa gaa ttc 1995 
Lys Asp Val Glu Ser lie Asn Lys Arg Leu Asp Val Val Glu Glu Phe 
605 610 615 

acg gca aac tea gaa agt atg caa ate act ggc cag tat etc cac aaa 204 3 

Thr Ala Asn Ser Glu Ser Met Gin lie Thr Gly Gin Tyr Leu His Lys 
620 625 630 
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etc cca gac tta gaa aga ctg etc gga cgc ate aag tec age get cga 2091 
Leu Pro Asp Leu Glu Arg Leu Leu Gly Arg lie Lys Ser Ser val Arg 
635 640 645 650 

tea tea gee tct gtg ttg cct get ctt ctg ggg aaa aaa gtg etg aaa 2139 
Ser Ser Ala Ser Val Leu Pro Ala Leu Leu Gly Lys Lys Val Leu Lys 
655 660 665 

caa cga gtc aaa gca ttt ggg caa att gtg aaa ggg ttc aga agt gga 2187 
Gin Arg Val Lys Ala Phe Gly Gin He Val Lys Gly Phe Arg Ser Gly 
670 675 680 

att gat ctg ttg ttg get eta cag aag gaa tea aat atg atg agt ttg 223 5 

He Asp Leu Leu Leu Ala Leu Gin Lys Glu Ser Asn Met Met Ser Leu 
685 690 695 

ctt tat aaa etc tgt aaa ctt cct ata tta gta gga aaa age ggg eta 2283 
Leu Tyr Lys Leu Cys Lys Leu Pro He Leu Val Gly Lys Ser Gly Leu 
700 705 710 

gag tta ttt ctt tct caa ttc gaa gca gec ata gat age gac tct cca 233f 
Glu Leu Phe Leu Ser Gin Phe Glu Ala Ala He Asp Ser Asp Phe Pro 
715 720 725 730 

aat tat cag aac caa gat gtg aca gac gaa aac gcz gaa act etc aca ?, 379 
Asn Tyr Gin Asn Gin Asp Val Thr Asp Glu Asn Ala Glu Thr Leu Thr 
735 740 745 

ata ctt ate gaa ctt ttt ate gaa aga gca act caa tgg tct gag gtc 2427 
He Leu He Glu Leu Phe He Glu Arg Ala Thr Gin Trp Ser Glu Val 
750 755 760 

act cac ace ata age tgc eta gac gec ccg aga ccc ccc gca acc gca 2475 
He His Thr He Ser Cys Leu Asp Val Leu Arg Ser Phe Ala lie Ala 
765 770 775 

gca agt etc tct get gga age atg gee agg cct get att ttt ccc gaa 2523 
Ala Ser Leu Ser Ala Gly Ser Met Ala Arg Pro Val He Phe Pro Glu 
780 785 790 

tea gaa get aca gac cag aat cag aaa aca aaa ggg cca ata ctt aaa 2 571 

Ser Glu Ala Thr Asp Gin Asn Gin Lys Thr Lys Gly Pro He Leu Lys 
795 800 805 810 

ate caa gga eta tgg cat cca ttt gca gtt gca gee gat ggt caa ttg 2619 
He Gin Gly Leu Trp His Pro Phe Ala Val Ala Ala Asp Gly Gin Leu 
815 820 825 

cct gtt ccg aac gat ata etc ctt ggc gag get aga aga age agt ggc 2667 
Pro Val Pro Asn Asp He Leu Leu Gly Glu Ala Arg Arg Ser Ser Gly 
830 835 840 
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age ate cac cct egg tea ttg cca ccg acg gga cca aac acg ggc gga 2715 

Ser He His Pro Arg Ser Leu Leu Leu Thr Gly Pro Asn Met Gly Gly 
845 350 855 

aaa cca act cct etc cgt gca aca cgc ctg gec gec ace tec gee caa 2763 

Lys Ser Thr Leu Leu Arg Ala Thr Cys Leu Ala Val He Phe Ala Gin 
860 865 870 

cct ggc cgc cac gtg ccg cgc gag cct tgc gaa ate tec etc gtg gat 2811 

Leu Gly Cys Tyr Val Pro Cys Glu Ser Cys Glu He Ser Leu Val Asp 

875 880 885 890 

act ate ttc aca agg ctt ggc gca tec gat aga ate atg aca gga gag 2859 

Thr He Phe Thr Arg Leu Gly Ala Ser Asp Arg He Met Thr Gly Glu 
895 900 905 

agt acc ttt ttg gta gaa tgc act gag aca gcg tea gtt ctt cag aat 2907 

Ser Thr Phe Leu Val Glu Cys Thr Glu Thr Ala Ser Val Leu Gin Asn 
910 915 920 

gca acc cag gat tea eta gca acc etc gac gaa ccg ggc aga gga acc 295^ 

Ala Thr Gin Asp Ser Leu Val He Leu Asp Glu Leu Gly Arg Gly Thr 
925 930 935 

agt act ccc gac gga cac gee acc gca cac teg get ttt cgt cac ctg 3003 

Ser Thr Phe Asp Gly Tyr Ala He Ala Tyr Ser Val Phe Arg His Leu 
940 945 950 

gta gag aaa gtt caa tgt egg atg etc ttt gca aca cat tac cac cct 3051 

Val Glu Lys Val Gin Cys Arg Met Leu Phe Ala Thr His Tyr His Pro 

955 960 965 970 

etc acc aag gaa ttc gcg tct cac cca cgc gee acc ccg aaa cac acg 3099 

Leu Thr Lys Glu Phe Ala Ser His Pro Arg Val Thr Ser Lys His Met 
975 980 985 

get tgc gca ttc aaa* tea aga tct gat tat caa cca cgt ggt tgt gat 3147 

Ala Cys Ala Phe Lys Ser Arg Ser Asp Tyr Gin Pro Arg Gly Cys Asp 
990 995 1000 

caa gac eta gtg ttc ttg tac cgt tta acc gag gga get tgt cct gag 3195 

Gin Asp Leu Val Phe Leu Tyr Arg Leu Thr Glu Gly Ala Cys Pro Glu 
1005 1010 1015 

age tac gga ctt caa gtg gca etc atg get gga ata cca aac caa gtg 3243 

Ser Tyr Gly Leu Gin Val Ala Leu Met Ala Gly He Pro Asn Gin Val 
1020 1025 1030 

gtt: gaa aca gca tea ggt get get caa gee atg aag aga tea att ggg 3291 

Val Glu Thr Ala Ser Gly Ala Ala Gin Ala Met Lys Arg Ser He Gly 

1035 1040 1045 1050 
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gga aac ttc aag tea age gag cza aga tec gag tec tea age ctg cat 3339 

Glu Asn Phe Lys Ser Ser Glu Leu Arg Ser Glu Phe Ser Ser Leu His 
10S5 1060 10 65 

gaa gac tgg etc aag tea ttg gcg ggt att tct cga gtc gec cac aac 3387 
Glu Asp Trp Leu Lys Ser Leu Val Gly lie Ser Arg Val Ala His Asn 
1070 1075 1080 

aat gec ccc act ggc gaa gac gac tac gac act ttg ttt tgc tta tgg 343 5 

Asn Ala Pro lie Gly Glu Asp Asp Tyr Asp Thr Leu Phe Cys Leu Trp 
1085 1090 1095 

cat gag ate aaa tec tct tac tgc gtc ccc aaa taaatggcta 3478 
His Glu lie Lys Ser Ser Tyr Cys Val Pro Lys 
1100 1105 

tgacataaca etatctgaag ctegtcaagt cttttgcctc tctgatgttt attcctctta 3538 
aaaaatgett atatatcaaa aaaccg:::c crcgattaaa aaaaaaaaaa aaaaaaaaaa 3598 
aaaaaaaa 3 6 0f> 

<210> 31 

<211> 1109 

<212> PRT *' 

<213> Arabidopsis thaliana ecccype Columbia 

<223> Polypeptide MSH6 

<400> 31 

Met Gin Arg Gin Arg Ser He Leu Ser Phe Phe Gin Lys Pro Thr Ala 
1 5 10 15 

Ala Thr Thr Lys Gly Leu Val Ser Gly Asp Ala Ala Ser Gly Gly Gly 
20 25 30 

Gly Ser Gly Gly Pro Arg Phe Asn Val Arg Glu Gly Asp Ala Lys Gly 
35 40 45 

Asp Ala Ser Val Arg Phe Ala Val Ser Lys Ser Val Asp Glu Val Arg 
50 55 60 

Gly Thr Asp Thr Pro Pro Glu Lys Val Pro Arg Arg Val Leu Pro Ser 
65 70 75 80 

Gly Phe Lys Pro Ala Glu Ser Ala Gly Asp Ala Ser Ser Leu Phe Ser 

85 90 95 

Asn He Met His Lys Phe Val Lys Val Asp Asp Arg Asp Cys Ser Gly 
100 105 110 

Glu Arg Ser Arg Glu Asp Val Val Pro Leu Asn Asp Ser Ser Leu Cys 
115 120 125 



WO 9919492A2J_> 



WO 99/19492 PCT/EP98/0697? ^ 



29 



Mec Lys Ala Asn Asp Val He Pro Gin Phe Arg Ser Asn Asn Gly Lys 
130 135 140 



Thr Gin Glu Arg 
145 

Arg Ser Val Glu 



Thr Pro Gly Met 
180 

Asp Glu Met Thr 
195 

Lys Arg Leu Lys 
210 

Val Asn Glu Gly 
225 

Asp Ala Asn Arg 



Asn His Ala Phe 
150 

Asp He Gly Val 
165 

Arg Pro Arg Ala 



Phe Lys Glu Asp 
200 

Met Leu Gin Asp 
215 

Thr Lys Phe Glu 
230 

Arg Arg Pro Asp 
245 



Ser Phe Ser Gly 
155 

Asp Gly Asp Val 
170 

Ser Arg Leu Lys 
185 

Lys Val Pro Val 



Pro Val Cys Gly 
220 

Trp Leu Glu Ser 
235 

Asp Pro Leu Tyr 
250 



Arg Ala Glu Leu 
160 

Pro Gly Pro Glu 
175 

Arg Val Leu Glu 
190 

Leu Asp Ser Asn 
205 

Glu Lys Lys Glu 



Ser Arg He Arg 
240 

Asp Arg Lys Thr 
255 



Leu His He Pro Pro Asp Val Phe Lys Lys Met Ser Ala Ser Gin Lys 
260 265 270 

Gin Tyr Trp Ser Val Lys Ser Glu Tyr Met Asp He Val Leu Phe Phe 
275 280 285 

Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu Leu Asp Ala Glu Leu Gly 
290 295 300 



His Lys Glu Leu Asp 
305 

Arg Gin Val Gly He 

325 

Leu Leu Ala Arg Gly 
340 

Ser Asp Gin Ala Lys 
355 

Leu Val Gin Val Leu 
370 

Pro Asp Ala Val His 
385 

Gin Lys Cys Ser Thr 

405 



Trp Lys Met Thr Met Ser 
310 315 

Ser Glu Ser Gly He Asp 
330 

Tyr Lys Val Gly Arg He 
345 

Ala Arg Gly Ala Asn Thr 
360 

Thr Pro Ser Thr Ala Ser 
375 

Leu Leu Ala He Lys Glu 
390 395 

Val Tyr Gly Phe A>\a Phe 
4^0 



Gly Val Gly Lys Cys 

320 

Glu Ala Val Gin Lys 
335 

Glu Gin Leu Glu Thr 
350 

He He Pro Arg Lys 
365 

Glu Gly Asn He Gly 
380 

He Lys Met Glu Leu 
400 

Val Asp Cys Ala Ala 
415 
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Leu Arg Phe Trp Val Gly Ser lie Ser Asp Asp Ala Ser Cys Ala Ala 
420 425 430 

Leu Gly Ala Leu Leu Met Gin Val Ser Pro Lys Glu Val Leu Tyr Asp 
435 440 445 

Ser Lys Gly Leu Ser Arg Glu Ala Gin Lys Ala Leu Arg Lys Tyr Thr 
4 50 455 460 

Leu Thr Gly Ser Thr Ala Val Gin Leu Ala Pro Val Pro Gin Val Met 
465 470 475 480 

Gly Asp Thr Asp Ala Ala Gly Val Arg Asn He He Glu Ser Asn Gly 
485 490 495 

Tyr Phe Lys Gly Ser Ser Glu Ser Trp Asn Cys Ala Val Asp Gly Leu 
500 505 510 

Asn Glu Cys Asp Val Ala Leu Ser Ala Leu Gly Glu Leu He Asn His 
515 520 525 

Leu Ser Arg Leu Lys Leu Glu Asp Val Leu Lys His Gly Asp He Phe 
530 535 540 

Pro Tyr Gin Val Tyr Arg Gly Cys Leu Arg He Asp Gly Gin Thr Met 
545 550 555 560 

Val Asn Leu Glu He Phe Asn Asn Ser Cys Asp Gly Gly Pro Ser Gly 
565 570 575 

Thr Leu Tyr Lys Tyr Leu Asp Asn Cys Val Ser Pro Thr Gly Lys Arg 
580 585 590 

Leu Leu Arg Asn Trp He Cys His Pro Leu Lys Asp Val Glu Ser He 
595 600 605 

Asn Lys Arg Leu Asp Val Val Glu Glu Phe Thr Ala Asn Ser Glu Ser 
610 615 620 

Met Gin He Thr Gly Gin Tyr Leu His Lys Leu Pro Asp Leu Glu Arg 
625 630 635 640 

Leu Leu Gly Arg He Lys Ser Ser Val Arg Ser Ser Ala Ser Val Leu 
645 650 655 

Pro Ala Leu Leu Gly Lys Lys Val Leu Lys Gin Arg Val Lys Ala Phe 
660 665 " 670 

Gly Gin He Val Lys Gly Phe Arg Ser Gly He Asp Leu Leu Leu Ala 

675 680 .... 685 

Leu Gin Lys Glu Ser Asn Met Met Ser Leu Leu Tyr Lys Leu Cys Lys 
690 695 700 
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Leu Pro He Leu Val Gly Lys Ser Gly Leu Glu Leu Phe Leu Ser Gin 
705 710 715 720 

Phe Glu Ala Ala He Asp Ser Asp Phe Pro Asn Tyr Gin Asn Gin Asp 
7 25 730 735 

Val Thr Asp Glu Asn Ala Glu Thr Leu Thr He Leu He Glu Leu Phe 
7 40 745 750 

He Glu Arg Ala Thr Gin Trp Ser Glu Val He His Thr He Ser Cys 
755 760 765 

Leu Asp Val Leu Arg Ser Phe Ala He Ala Ala Ser Leu Ser Ala Gly 
770 775 780 

Ser Met Ala Arg Pro Val He Phe Pro Glu Ser Glu Ala Thr Asp Gin 
785 790 795 800 

Asn Gin Lys Thr Lys Gly Pro He Leu Lys He Gin Gly Leu Trp His 
80S 810 815 

Pro Phe Ala Val Ala Ala Asp Gly Gin Leu Pro Val Pro Asn Asp He 
820 825 830 

Leu Leu Gly Glu Ala Arg Arg Ser Ser Gly Ser He His Pro Arg Ser 
835 840 845 

Leu Leu Leu Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Leu Leu Arg 
B50 855 860 

Ala Thr Cys Leu Ala Val He Phe Ala Gin Leu Gly Cys Tyr Val Pro 
365 870 875 880 

Cys Glu Ser Cys Glu He Ser Leu Val Asp Thr He Phe Thr Arg Leu 
885- 890 895 

Gly Ala Ser Asp Arg He Met Thr Gly Glu Ser Thr Phe Leu Val Glu 
900 905 910 

Cys Thr Glu Thr Ala Ser Val Leu Gin Asn Ala Thr Gin Asp Ser Leu 
915 920 925 

Val He Leu Asp Glu Leu Gly Arg Gly Thr Ser Thr Phe Asp Gly Tyr 
930 935 940 

Ala He Ala Tyr Ser Val Phe Arg His Leu Val Glu Lys Val Gin Cys 
945 950 955 960 

Arg Met Leu Phe Ala Thr His Tyr His Pro Leu Thr Lys Glu Phe Ala 
965 970 975 

Ser His Pro Arg Val Thr Ser Lys His Met Ala Cys Ala Phe Lys Ser 
980 985 990 



snnrtrv ooioiooa? i 
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Arg Ser Asp Tyr Gin Pro Arg Gly Cys Asp Gin Asp Leu Val Phe Leu 
995 1000 1005 

Tyr Arg Leu Thr Glu Gly Ala Cys Pro Glu Ser Tyr Gly Leu Gin Val 
1010 1015 1020 

Ala Leu Met Ala Gly lie Pro Asn Gin Val Val Glu Thr Ala Ser Gly 
1025 1030 1035 1040 

Ala Ala Gin Ala Met Lys Arg Ser He Gly Glu Asn Phe Lys Ser Ser 
1045 1050 1055 

Glu Leu Arg Ser Glu Phe Ser Ser Leu His Glu Asp Trp Leu Lys Ser 
1060 1065 1070 

Leu Val Gly He Ser Arg Val Ala His Asn Asn Ala Pro He Gly Glu 
1075 1080 1085 

Asp Asp Tyr Asp Thr Leu Phe Cys Leu Trp His Glu He Lys Ser Ser 

1090 1095 1100 » 

Tyr Cys Val Pro Lys 
1105 



<210> 32 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of ATHGENEA 
microsatellite 

<400> 32 

accatgcata gcttaaac'tt cttg 

<210> 33 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of ATHGENEA 
microsatellite 

<400> 33 
acataaccac aaataggggt gc 



24 



22 
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<210> 
<211> 
<212> 
< 2 13 > 


34 
18 
DNA 

rti.i.iiiLiai sequence 


<220> 
<223> 


Forward primer DMCIN-A for PCR 
thaliana ssp. Landsberg erecta 


<400> 


34 


/T .3 «T *- -J f- 


tg ttcgtg 


<210> 

< 2 1 1 > 
<212> 

< 4i X J > 


35 

1 R 
x o 

DNA 

Artificial sequence 


<220> 
<223> 


Reverse primer DMCIN-B for PCR 
thaliana ssp. Landsberg erecta 


<400> 


35 




aaca:tcc 


<210> 
<211> 
<212> 
^ x j ~> 


36 
DNA 

Artificial sequence 


<220> 
<223> 


Forward primer DMCIN-1 for PCR 
thaliana ssp. Landsberg erecta 


<400> 


36 




xcagctatga gattactcgt g 


<210> 
<211> 
<212> 
<213> 


37 
29 
DNA 

Artificial sequence 


<220> 
<223> 


Reverse primer DMCIN-2 for PCR < 
thaliana ssp. Landsberg erecta 


<400> 


37 



18 



18 



31 



gctctagatt tcccgctcta agactctct 29 
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< 2 10 > 

< 2 1 1 > 
<212> 
<213> 


3 8 
32 
DNA 

Arc if icial sequence 


- 


<220> 
<223 > 


Forward primer DMCIN-3 for PCR 
chaliana ssp. Landsberg erecta 


on genomic DNA of Arabidopsis 
" Ler " 


<400> 


38 




gctctagagc 


ctctcttaag taagtgattg at 


32 


<210> 
<211> 
<212> 
<213> 


39 
48 
DNA 

Art if icial sequence 




<220> 
< 223 > 


Reverse primer DMCIN- 4 for PCR 
-haliana ssp. Landsberg erecta 


m 

on genomic DNA of Arabidopsis 
"Ler" 


<400> 


39 




tcccccgggc 


tcgagagatc tccacggccc ccccagctct 


acgaaccc 48 


< 2 10 > 
<211> 
<212> 
<213> 


40 
26 
DNA 

Artificial sequence 




<220> 
< 223 > 


Forward primer DMCla for PCR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 


<400> 


40 




acgcgccgac 


gaattcgcaa gtgggg 


26 


<210> 
<211> 
<212> 
<213> 


41 
38 
DNA 

Art i f i c i a 1 sequence 




<220> 
<223> 


Reverse primer DMClb for PCR on 
chaliana ssp. Landsberg erecta 


l genomic DNA of Arabidopsis 
"Ler- 


<400> 


41 
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38 



tccatggaga tcccccgggt accgatctgc tccgaggg 

<210> 42 

<211> 20 

<2X2> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of ATEAT1 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 42 

gccactgcgt gaatgatatg 2Q 



<210> 43 

<211> 22 

<212> DNA 9 

<213> Artificial sequence 

<220> 

<223> Reverse primer for PCR amplification of ATEAT1 SSLP marker in 

Arabidopsis thaliana subspecies 

<400> 43 

cgaacagcca acattaattc cc 22 

<210> 44 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA63 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 44 

aaccaaggca cagaagcg 18 

<210> 45 

<211> is 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA63 SSLP marker in 
Arabidopsis thaliana subspecies 
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<400> 












acccaagcga 


ccgccacc 








18 


<210> 


46 












2 1 










<212> 


DNA 










<213> 


Artificial sequence 










<220> 












<223> 


Forward primer for PCR amplification 
Arabidopsis thaliana subspecies 


of 


NGA24 8 


SSLP 


marker in 


<400> 


46 










taccgaacca 


aaacacaaag g 








21 


<210> 


47 










<2 11 > 


22 










<212> 


DNA 










<213> 


Artificial sequence 










<220> 












<223> 


Reverse primer for PCR amplification 
Arabidopsis thaliana subspecies 


of 


NGA24 8 


SSLP 


marker in 


<400> 


47 










t ctgtatctc 


ggtgaattct cc 








22 


<210> 


48 










< 2 1 1 > 


22 










<212> 


DNA 










<213> 


Artificial sequence 










<220> 












<223> 


Forward primer for PCR amplification 
Arabidopsis thaliana subspecies 


of 


NGA12 8 


SSLP 


marker in 


<400> 


48 










ggtctgttga 


tgtcgtaagt eg 








22 


<210> 


49 










<211> 


22 










<212> 


DNA 











<213> Artificial sequence 

<220> 
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<223> Reverse 

Arabidopsis thaliana subspecies 

<400> 49 



?f^ me f f° r PCR am P lif Nation of NGA128 SSLP marker in 



accttgaaac ccttagggag gg 

<210> so 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> ^° r ^ d P rimer for pc * amplification of NGA280 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 50 
ctgatctcac ggacaatagt gc 



■22 



<210> 51 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 
<223> 



Reverse primer for PCR amplification of NGA280 SSLP marker in 
Arabidopsis thaliana subspecies 



<400> 5i 
ggctccataa aaagtgcacc 

<210> 52 

<211> 21 

<212> DNA 

<213> Artificial sequence 



20 



<220> 
<223> 



Forward primer for PCR amplification of NGAlli SSLP marker in 
Arabidopsis thaliana subspecies 



<400> 52 
ctccagttgg aagctaaagg g 



<210> 53 

<211> 21 

<212> DNA 

<213> Artificial sequence 



21 
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v 220 : 

2 2 3 : 



Reverse primer for PCR ampl if icat ion of NGAlll SSLP marker in 
Arabidopsis thaliana subspecies 



<400> 53 

tgttttttag gacaaatggc g 21 



<210> 


54 


<211> 


20 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Forward primer for PCR amplification 




Arabidopsis thaliana subspecies 


<400> 


54 


ccctcacatc 


caaaacccac 



20 



<210> 55 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA168 SSLP marker in 
Arabidopsis thaliana subspecies 



<400> 


55 




gcacataccc 


acaaccagaa 


20 


<210> 
<211> 
<212> 
<213> 


56 
20 
DNA 

Artificial sequence 




<220> 
<223> 


Forward primer for PCR amplification of NGA1126 
in Arabidopsis thaliana subspecies 


SSLP marker 


<400> 


56 




cgctacgctt 


ttcggtaaag 


20 
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<210> 57 

<211> 20 

<212> DNA 

<-l3> Artificial sequence 



<220> 
<223> 



Reverse primer for PCR amplif icacion of NGA1126 sstp 

m Arabidopsis chaliana subspecies NGA1126 SSL* marker 



<400> 57 
gcacagccca agccacaacc 

<210> 58 

<211> 20 

<212> DNA 

<213> Artificial sequence 



20 



<220> 
<223> 



<400> 5Q 
aaagagatga gaatttggac 

<210> 59 

<211> 23 

<212> DNA 

<213> Artificial sequence 



20 



<220> 
<223> 



<400> 59 
acatatcaat atattaaagt age 

<210> so 

<211> 18 

<212> DNA 

<213> Artificial sequence 



23 



<220> 
<223> 



Ir^opsrs^Lf 0 " r PlifiCaCi0n ° f NGA16S SSLP «rl»r in 

Araoiaopsis thaliana subspecies 



<400> so 



9919492A2J_> 



WO 99/19492 



PCT/EP98/06977 



40 

:cgcccaccg cactgccg 18 



<210> 61 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA168 SSLP marker in 
Arabidopsis chaliana subspecies 



<400> 


61 




gaggacatgt 


ataggagcct eg 


22 


<210> 
<211> 
<212> 
<213> 


62 
20 
DNA 

Artificial sequence 


m 


<220> 
<223> 


Forward primer for PCR amplification of AthBIQ2 
in Arabidopsis chaliana subspecies 


SSL? marker 


<400> 


62 




cgacctccLc 


ttccatggag 


20 


<210> 
<211> 
<212> 


63 
22 
DNA 





<213> Artificial sequence 

<220> 

<223> Reverse primer for PCR amplification of AthBI02 SSL? marker 

in Arabidopsis thaliana subspecies 

<400> 63 

ttaacagaaa cccaaagctt tc 2 2 



<210> 64 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of AthUBIQUE SSLP marker 
in Arabidopsis thaliana subspecies 
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<400> 64 
aggcaaacgt ccaccccatc g 

21 

<210> 65 
<211> 20 
<212> DNA 

<213> Artificial sequence 



<220> 
<223> 



in V I^K-S rimer PCR am P lifi "Cion of AthUBIQUE SSLP marker 

in Arabidopsis chaliana subspecies marker 



<400> 65 
acgacacggc agatttctcc 



<210> 66 

<211> 21 

<212> DNA 

<213> Artificial sequence 



20 



<220> 
<223> 



Forward primer for PCR amplification of NGA172 SSLP marker in 
Arabidopsis chaliana subspecies er ln 



<400> 66 
agctgccccc ttatagcgtc c 

<210> 67 

<211> X9 

<212> DNA 

<213> Artificial sequence 



21 



<220> 
<223> 



22^2" S f or PCR r plificacion of NGA172 SSLP ^ 

Arabidopsis thaliana subspecies 



<400> 67 
catccgaatg ccattgttc 



<210> 68 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 



19 
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<223> 


Forward primer for PCR ampl i f icac ion 
Arabidopsis chaliana subspecies 


of 


NGAl 2 6 


SSLP 




4 0 0 > 


63 










gaaaaaacgc 


cacctccgtg g 








21 


<210> 
<211> 
<2 12 > 
<213> 


69 
22 
DNA 

Artificial sequence 










<220> 
<223> 


Reverse primer for PCR amplification 
Arabidopsis chaliana subspecies 


of 


NGAl 2 6 


SSLP 


marker in 


<400> 


69 










caagagcaac 


atcaagagca gc 








22 


<210> 
<2I1> 
<212> 
<213> 


70 
20 
DNA 

Artificial sequence 










<220> 
<223> 


Forward primer for PCR amplif icacion 
Arabidopsis chaliana subspecies 


of 


NGAl 6 2 


SSLP 


marker in 


<400> 


70 










cacgcaaccc 


gcatccgagg 








20 


<210> 
<211> 

<213> 


7 1 
22 
DNA 

Artificial sequence 










<220> 
<223> 


Reverse primer for PCR amplif icacion 
Arabidopsis thaliana subspecies 


of 


NGAl 6 2 


SSLP 


marker in 


<400> 


71 










ccctgccact 


cttttcctct gg 








22 



<210> 72 

<211> 21 

<212> DNA 

<213> Artificial sequence 



IMSDOCIO <WO 9919492A2_I_> 



WO 99/19492 


PCT/EP98/06977 




43 




< 2 2 0 > 






<223> 


F51S tftaliana subspecies 


SSLP marker in 


<400> 


72 




tggatttct 


c cccctcttca c 


2: 


<210> 
<211> 
<212> 
<213 > 


73 
21 
DNA 

Artificial sequence 


<220> 






<223> 


Reverse primer for PGR amplification of NGA6 
Arabxdopsxs thaliana subspecies 


SSLP marker in 


<400> 


73 




atggagaagc 


tcacactgat c 


21 


<210> 
<211> 
<212> 
<213 > 


74 
20 
DNA 

Artificial sequence 




<220 > 






<223> 


Irlh?^ Prlmer f ° r PCR «*"«cacion of NGA12 
Arabxdopsis thaliana subspecies 


SSLP marker in 


<400> 


74 




aacgctgccc 


tcccctcctc 


20 


<210> 
<211> 
<212> 

^ <L -J ^> 


75 
22 
DNA 

Artificial sequence 




<220> 






<223> 





<400> 75 
tgatgctctc tgaaacaaga gc 



22 
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<210> 


76 




Z 1 


<212 > 


DNA 


<213> 


Arc i f icial sequence 


< 4. 4. V > 




<223> 


Forward primer for PGR amplif ication 




Arabidopsis thaliana subspecies 


<400> 


76 



gagggcaaac ctctatttcg g 21 

<210> 77 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplif icacion of NGA8 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 77 

cggctcccgc ctacaaacac cc 22 

<210> 78 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA1107 SSLP marker 
in Arabidopsis thaliana subspecies 

<400> 78 

gcgaaaaaac aaaaaaatcc a 21 

<210> 79 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA1107 SSLP marker 
in Arabidopsis thaliana subspecies 

<400> 79 
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cgacgaaccg acagaattag g 

<210> 80 

<211> 21 

<212> DNA 

<213> Artificial sequence 



21 



<220> 
<223> 



SM^SLTLSU"*-' ° £ NGA2 2 S SS, P m in 



<400> 80 
gaaatccaaa ccccagagag g 



<210> 81 

<211> 22 

<212> DNA 

<213> Artificial sequence 



21 



<220> 
<223> 



<400> si 
tctccccacc agttttgtgt cc 



<210> 82 

<211> 19 

<212> DNA 

<213> Artificial sequence 

<220> 
<223> 



22 



<400> 82 
taccgtcaat ttcatcgcc 



19 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



83 
22 
DNA 

Artificial sequence 
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<400> 


83 




ggacccccaa 


ctgtaaaatc cc 


22 


<210> 
<211> 
<212> 
<213> 


84 
22 
DNA 

Art i f i c ia 1 sequence 




<220> 
<223> 


Forward primer for PCR amplification of CA72 
Arabidopsis thaliana subspecies 


SSLP marker in 


<400> 


84 




aatcccagta 


accaaacaca ca 


22 


<210> 
<211> 
<212> 
< 2 1 3 > 


85 
20 
DNA 

Artificial sequence 





<220> 

<223> Reverse primer for PCR amplification of CA7 2 SSLP marker: in 

Arabidopsis thaliana subspecies 

<400> 85 

cccagtctaa ccacgaccac 20 

<210> 86 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA151 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 86 

gttttgggaa gttttgctgg 20 

<210> 87 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 
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<223> IrIbidL« rime ! f° r PCR ampUficaClon of NGA ^l SSLP marker in 

Arabidopsxs chaliana subspecies 

-.400 > 87 
cagcctaaaa gcgagagtat gatq 

24 

<210> 88 
<211> 22 
<212> DNA 

<213> Artificial sequence 



<220> 
<223> 



Ir^doos^f ^Li 0r r PUfiCaCi0n ° f NGA10S SSLP in 
AraiDidopsis thalxana subspecies 



<400> 88 
gccatggagc ttctagggca eg 

<210> 89 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 



<223> Re verse primer for PCR amplification of NGA106 SSLP marker in 

Arabxdopsis thaliana subspecies 



<400> 89 
tgccccattt cgctcttctc 



<210> 90 

<211> 20. 

<212> DNA 

<213> Artificial sequence 



20 



<220> 
<223> 



Forward primer for PCR amplification of NGA139 SSLP marker in 
Arabidopsis thaliana subspecies 



<400> 90 
agagctacca gatccgatgg 



<210> 9i 

<211> 21 

<212> DNA 

<213> r Artificial sequence 



20 



5DOCIQ <WO 0010*09*^ I v 



WO 99/19492 



PCT/EP98/06977 



48 



<220> 






<223> 


Reverse primer for PCR amplif ication 
Arabidopsis chaliana subspecies 


of NGA13 9 SSLP marker in 


<400> 


91 




csctf r f r*rr t" f f~ 


caccacccag g 


21 


<210> 


92 




< ^ X X > 






<212> 


DNA 




<213> 


Art i f i c i a 1 sequence 




<220> 






<223> 


Forward primer for PCR amplification 
Arabidopsis thaliana subspecies 


of NGA76 SSLP marker in 


<400> 






ggagaaaatg 


tcacccccca cc 


m 

22 


<210> 


93 




<211 > 


20 




<212> 


DNA 




<213> 


Artificial sequence 




<220> 






<223> 


Reverse primer for PCR amplification 
Arabidopsis thaliana subspecies 


of NGA76 SSLP marker in 


<400> 


93 




c*yy<«-'CiL.yyycl 


yacacccacg 


20 


<210> 


94 




<2 1 1 > 


20 




<212> 


DNA 




<-tlJ > 


Artificial sequence 




<220> 






<223> 


Forward primer for PCR amplification of ATHS0191 SSLP marker 
in Arabidopsis thaliana subspecies 


<400> 


94 




ctccaccaat 


catgcaaatg 


20 
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<210> 95 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 
<223> 



Reverse primer for PCR amplification of ATHS0191 SSLP marker 
in Arabidopsis thaliana subspecies marker 



<400> 95 
tgatgttgat ggagatggtc a 



<210> 96 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 



21 



<223> ^°^ d P rimer for PCR amplification of NGA129 SSLP marker in 

Arabidopsis thaliana subspecies 



<400> 96 
tcaggaggaa ctaaagtgag gg 



22 



<210> 
<211> 
<212> 
<213> 


97 
22 
DNA 

Artificial sequence 




<220> 






<223> 


Reverse primer for PCR amplification of NGA129 
Arabidopsis thaliana subspecies 


SSLP marker in 


<400> 


97 




cacactgaag 


atggtcttga gg 


22 


<210> 
<211> 
<212> 
<213> 


98 

8062 
DNA 

Arabidopsis thaliana ecotype Columbia 




<220> 






<223> 


Genomic DNA sequence of AtMSH6 




<400> 


97 





ttttttggtt gctaacaatn aaggtatacg gttttatgtc atcaatataa ctatatatac 



60 
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W* V** %J Gfc Ca CJL w GL 


aoatat ar^r 

fc* c» a l a v, c» L> 


cgt tt 1 1 tea 


t t tatcaaac 


aaaacaacaa 


gactttcttt 


120 


tcactcttta 

W w u >— w w w w w <— » 


cat r acr t* raa 


cadaa c acaa 


gataaacgac 


ategt t taat 


catttcccaa 


180 


tctcacccct 


aaatttaaca 

W» W* www U wl W w\ 


v-uuayadCCC 


tccccacctc 


cgcaagcaca 


gectgattag 


240 


gaacagctt t 


acca 1 1 ctca 

W* W- W> *mM. w w w W w» 


tAt r PPfrra a 
UC *L. LLL Ly da 


ccacccgagt 


cctct cattg 


atctgtttcg 


300 


ccaaacccgc 


t tcxtcracatc 

W Wj w w* w> w\ w- w> 


^ *— U» V, LULid 


acctcgcccc 


ctgtatcatc 


aacctcacct 


360 


ctgct ttcac 


acoatccatc 


y Luy uayytL 


CtyLLuCCtC 


t tccagcttc 


ttcgtgttaa 


420 


w* W> W\ W- WJ WJ QU 


^3^^y cay a u 


llCCCCClCl 


tgt t cgaacc 


ggcatcgaat 


ctcttaaccg 


480 


1 1 tcraacccrc 


osrAfrrrr p p 
ya^a^Ly u L. L, 


CLCayayCCCf 


cgt taacege 


teteggateg 


egtaggtetc 


540 


crcrf* p f p r rra 


^ ^ ^ ^ rr 3 t* ^ /•» 


tggagaacta 


ctggttccca 


gccttgtgtc 


actgctcctg 


600 




yy tatcyLC 


gacyaatcga 


gagaaaggaa 


caacgegaaa 


attttattaa 


660 


p r* P n a. <-r r — ~ r 
l. l. t_ y c*y w w l. 


yaaac tgaga 


aacgacgaag 


a tgaagaacg 


ttgttgagag 


gattgtgata 


720 


tUldCdCdCd 


cgaagat tgg 


c ttctggaga 


acccgaccac 


cttt ttctcc 


attttegtet 


780 


l. uyyaaty l. w 


l, L. L. dydyd C y 


at tgacgacg 


tgtcattatc 


tgat ttgcag 


ttaaccaatg 


840 


etc r r r a crar 
l- l l l» w tyyy w 


Lyyattcy uy 


gcacaccaca 


ttacccgat t 


tggctcaatg 


gt tttatata 


900 


www ™J ™J www 


t racrr p rercrr 


t.y ag c cat 


cat taaaat t 


aagctaacca 


aaaattctcg 


960 


taaaac r tat 

w wm w\ u w% W w w w\ w 




a u tcy y accc 


ct tacttcca 


gaaccgaatt 


attcgaaacc 


1020 


croaatcaacc 




CLddLyCCCy 


attgacccgc 


tggctagaaa 


gatccaaegg 


1080 


tatacaataa 


t aoaaca t a a 


a u> v»y y ct \_y y c 


cat caaagee 


t caaagagtg 


aacagtcaac 


1140 


aaaaaaagtt 


craaccctaaa 


y ay La l uy L, L» 


P ^* ^ 3 ^ ^ ^ 

LCCyCCaL L C 


c tacgacgea 


aggegaaaa t 


1200 


ttttggcgcc 


aatctttccc 


LLLLL LL.ydd 


c t c tec cage 


t caaaacat c 


gtttctctct 


1260 


cactctctct 


cacaattcca 


aaaaa w y v,ay 


cgccagagac 


cgat tttgee 


t t tct tccaa 


1320 


aaacccacgg 


ctrcTctTact ac 


araacrcrcTt 1 1 rr 
y aa yyy LLLy 


y tc teeggeg 


atgetgetag 


eggeggggge 


1380 


QQcaacacraa 

33 3^"33**3 


accaroa p p p 


aaLy Lyaayy 


aaggggatgc 


taaaggegae 


gcttctgtac 


1440 


gttttgctgt 


ttcgaaatct 


gtcgatgagg 


ttagaggaac 


ggatactcca 


ceggagaagg 


15O0 


ttccgcgtcg 


tgtcctgccg 


tctggattta 


ageeggctga 


atccgccggt 


gatgettegt 


1560 


ccctgctctc 


caacattatg 


cataagcttg 


taaaagtcga 


tgatcgagat 


tgttctggag 


1620 


agaggtacta 


atcttcgatt 


ctctcaactt 


tgttatcttt 


agctggaaga 


agaagattcg 


1680 
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cgcaacccgc 


cigcactcgcc 


ggagagac cc 


t era t" r A<*»r rr/-» 


attggatcgt 


tgcttacaaa 


1740 


crcccaggag 


ccgagaagat 


gtcgtcccgc 


tcraacciat* rr» 


atctccatgt 


aegaaggeca 


1800 


acgatgttat 


ccctcaacct 


egttccaaca 




tcaagaaaga 


aaccatgccc 


1860 


ttagtttcag 


cgggagagct 


gaactcagat 




tataggagta 


gatggcgacg 


1920 


ctcctggccc 


agaaacacca 


9j go; a t area c c 




tegctcgaag 


cgagttctgg 


1980 


aggacgaaac 


gacctccaag 


gaggataagg 




ggactccaac 


aaaaggctga 


2040 


aaatgcccca 


ggacccggtc 


tgeggagaga 


3 Of A a 24 /"T 3 

ayaddyddgt 


aaacgaagga 


accaaatctg 


2100 


aatggcttga 


gccttctcga 


atcagggacg 


ccaatagaag 


aegtcctgat 


gatccccttt 


2160 


acgatagaaa 


gaccttacac 


ataccacctg 


atgttttcaa 


gaaaatgtct 


gcatcacaaa 


2220 


agcaataccg 


gagtgctaag 


agegaacaca 


tggacattgt 


gcttcccttt 


aaagtggtta 


2280 



ccacgcc.cg ccccccccag ggg.aacccc acgagccgc, cg,g«agac gcggaattag 
gecacaagga gcccgaccgg aagacgacc. cgagcgg„ t ggg aaa acgc a 9 a=. 9gt aa 
acca g ccga. acaaccggcc tgetcga.ee actgtgtct, t..,ttctg. cccaccttt 
cgccctaggt tggtatctct 9 aaa 9 t 9gga tagatgaggc a g t g c.a,a g ctactagctc 
gegggtaagg 9 aac=accac accttatgga actcgettac tgctactccg gctaggatct 
a. 9 .a.c 99 a .atc.cttca agc.tc.tta gttaggatcc tgagaactca gg.tgttctc 
ttattegcta tataacaagt cctttc.ee. aggagcaac. aac M aa=tt gc.ct.ee 
tgtgtgccca ccggcaaggc .cacacaccc agecacecc cgcc.geeca ccgc.gc.ae 
agccacggac aat.tatgcc cactcgtacg cggtaccctc accccgeccc teaeggaggc 
ccccaageec cgcgccg... ccggacagcc aeacacgett ccaacgaaa et. g catgca 
gacccatatg cccecccacc ccaecaacea cgcacegaca eaceegtege cecccccg.. 
agacacaaag cc gg acgaac cgagcagcca gaaacacecg acoaagcaaa agecagagge 
gccaacaccg cagcctccc cgg.caggcc aaggagageg ctgcagaetg tccccgacca 
cccctctcce cgcacaccac cttcacgccg ccta.ccc ..tggecate ccggcceg.c 
cacc.ga.aa ccccaaggaa gecagcccag gc.ec.acce cace.acagc aagcgaggga 
aacaccgggc ccg.cgecgc ccaccccccc gecacaaaag ..jgctcgec. 
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cacct caeca 


text tcacrt ec 


day L. <w 


tgaaaaat t a 


cactcttctt 


taccaatctt 


3360 
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yddddtCdCC 
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3420 
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cacgcaaacc actggccagt acccccacaa acecccagac ttagaaagac cgctcggacg 
caccaagcct agcgttcgac catcagcctc tgtgccgccc gctcttctgg ggaaaaaagc 
gctgaaacaa cgagtaagta tcaatcacaa gctttctgag caacgccccc cacgagtagt 
acaggactaa aacattacgg gtctagccaa agactgttct ccccctctcg caatgtctgg 
tcatccatta cacttctctt aacttatcgc atcgcaggct aaagcatttg ggcaaactgt 
gaaagggttc agaagtggaa ttgatctgtc gtcggctcta cagaaggaat caaatacgat 
gagcccgctc tacaaactct gtaaactccc tacactagca ggaaaaagcg ggccagagcc 
atttctttcc caattcgaag cagccataga tagcgacccc ccaaattatc aggcgcccat 
ctacccccca Cacttcacaa caaaacgtct gtrcactactc aaagcaatgc atacggctta 
gacctcaacc cacaccccga ggaccccaaa gggacctgcc tcttaetcct: aacgtccttg 
gacggtctga cttatttcca actcgaactc attaatctcg caccagaacc aagacgcgac 
agatgaaaac gccgaaaccc ccacaatacc tatcgaaccc cccaccgaaa gagcaaccca 
acggcccgag gtcacccaca ccataagctg cctagatgcc ccgagatctt ccgcaaccgc 
agcaagtctc tctgctggaa gcatggccag gcctgttatt tttcccgaac cagaagccac 
agatcagaac cagaaaacaa aagggccaat acctaaaacc caaggaccac ggcacccact 
cgcagccgca gccgatggcc aaccgcccgc tccgaatgac atactccttg gcgaggctag 
aagaagcagt ggcagcaccc atcctcggtc atcgttaccg acgggaccaa acatgggcgg 
aaaatcaacc cctcttcgtg caacacgtct ggccgttacc tttgcccaag tctgtatact 6000 
cgttagacaa ttactctact ctetgcaatc agttcctcaa catgaacaac aaatcctgtt 
ttctgtccgc agcttggctg ccacgtgccg tgtgagtctt gcgaaatctc cctcgtggat 
accaccccca caaggcctgg cgcatccgac agaaccatga caggagagag taagctttgt 6180 
cctcaaaaca ccaattcctc gaactaccta ctcagatttt gtctgaccgg acaaggtggt 6240 
cccgccttct tccaggcacc tCCttggtag aacgcaccga gacagcgtca gttcttcaga 630O 
acgcaactca ggattcacta gtaatccttg acgaactggg cagaggaacc agcaccctcg 6360 
acggacacgc cattgcacac tcggcaaccc gctcctctcc tccaactut acttgtcgac 6420 
caacaaaaac atgcaatcca ttttgccgaa acttactgat ctacaccagg tttttcgtca 6480 
cccggtagag aaagcccaat gtcggatgct ctttgcaaca cattaccacc ctctcaccaa 
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ggaatccgcg tctcacccac gtgccacctc gaaacacatg gcttgcgcac ccaaatcaag 6600 

acccgattac caaccacgtg gccgtgatca agacccagcg ctcccgcacc gtccaaccga 6660 

gggagctcgt cctgagagct acggacctca agtggcactc acggctggaa taccaaacca 6720 

agtggttgaa acagcatcag gtgctgccca agccatgaag agatcaactg gggaaaactc 6780 

caagtcaagc gagctaagat ccgagttctc aagtctgcat gaagactggc tcaagtcatt 6840 

ggtgggcatt tctcgagtcg cccacaacaa tgcccccact ggcgaagatg actacgacac 6 900 

tttgtcttgc ttatggcatg agaccaaatc ctcttactgt gttcccaaat aaatggctat 6 960 

gacacaacac tatctgaagc tcgctaagtc ttttgcttct ctgatgttta ttcctcttaa 7020 

aaaatgccta tatatcaaaa aattgtttcc tcgattataa caagattata tacgtatctg 7080 

ccggtttagc tatggtatat aatatatgta tgttcatgag actggccaag agaaataccc 7140 

m 

acaaacagca catcaagaag gaaacacgct tacgcactaa cccaagtttc aagacaaact 7200 

gcaaataacc tcgaccaaag ttgcaaagac caaacacaaa ccacaaaact tacaagactc 7260 

aagctccgaa ctccccaaaa ccaaaaaaaa aaacagaaca catcttgctg catctacaaa 7320 

caacacaaac ctacacagct tataacccac ccaccactga gaccaacatc agaatcattc 7380 

tccatttctt catcttcacc ctcatcacca ccaccaccac catgacgatc ccccccctcc 7440 

ccacgcaacc tagcaatctc accctgagct ctaccaacaa cctgcttctc ccgcaactcc 7500 

aaacctctct gaaaaccagc ccccacctcc tccaactcct ccacccgctc ttccctaccc 7560 

ccctccatcc tctcataaac cctcccaaac ccctcaacag aatccgccaa catcctatac 762 0 

gaagcagcgt cattaacctt ccccctcccg tactcaacct catcatcctc accctcctcc 7680 

tcttcagaat caccaggact atccaccatc ccaccaaacc caccagactt atctaaataa 774 0 

accttagtgt tcataaacac aaactcacct gaatcaacac cacaagccaa acctaaatcc 7800 

gacttgggcg aaacacaaag caacacatcc aacttattga aaaacgacca tttacttgaa 7860 

cctaaacctg actcctcaac cctaaccttc tcctttccat actccctctt caagtcatca 7920 

atcactctcc tacattgcgc ctcagatctc tccatcctta gctccccact cactccctca 7980 

gctacttcat tccaatcctc gctcctcaaa ctccttctac ccaatcgcaa aaacccatct 8040 

ccccaaactt caagcaacac aa 80 6 2 
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Figure 3 

Comm»nta/R*ferBnca*: 13 =; &' side o1 SS (AtM3H3) 2104bp in pUCie/Smal 
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TTTTTTCOTTCCTW.CAATAAAGG7i\TACGGTTTTATC5TCATCA^TATAA 
CTATATATAAAA&VkAfGAAAGATATAT^^^^ 

ftf^JkC AftCAAEACT TTTT1TTTACT TT TT AC A7T5STC AACAAAAT AGAA 

CATA.^.CGACATCC7TTAATCATTTCCCAATTTTACCCC7AAGTTTftACA 

CCTA<jAACCTTCTCCATCTTCGCAAGCACASCCTDA^ACCAACAGCTTT 

ACCATTCTCATAtTCCTGAACTACCTGA^TCCTCTCATTEATCTGTTTCG* 

tCAAATCCGCTTCTGACATCTTCTTCTCCAATCTCCCTlTCTGTATCATC 

AACCTCACCTCTGCTTTCACACGATCCATCGCCGCAGGCT<ITGTTTCT7C 

TTCCAGCTTCTTCGTCTTAATCACCCCAACCGCCGTAGArTTCCCCTTTT 

TGTTCGAACCCGCATCBAAT7TC^AACCGT7TQAACCGCMCACCGTrT 

CTCAGAGCTGCGTTAACCGCTTTCGGAtCGCGTAGGTCTTGGCTCTTTTG 

TTTTGATTTCKSGAGAACTACTGDTtCCCAGTCTTGTGTTACTDCtCCTG 

GGTATCTGCTCGDCATCGTCGATGAATTGAGAGAAAGGAACAACGCGAAA 

ATTTTATtAATCfGAGTTTTGAAATTGAGAAACGATGAAiGATGAAGAATG 

. rT GTTGAGAGGATTB«A7ATt7A7ATATACGAAGATTGGrKCTGGA6A 

WT CCA7CATCrr7TTCTCCATTTTCCTCTCTGGAACGTtCTTAGAGATG 

ATTGACGACGTG7CA7TATC7GA?7TGCAGT7AADGAATGCTTW - TQBGT 

TGGATtCGTGGTAfACCATATTATCC^ATTTGGCTCAATGGTTTTATATA 

AA TTTGG7T-?7CGGTTCGGTTATGAr.T7A7CATTAAAATTAAGCtAACCA 

^AAA777TCG7AAV.TTTAT7TCGG7TrCAATTCCGATC2CfTACTTCCA 

GAACCOAAT7A7TCG.WCCGGGG7TA^CGAACCGAATACCAATGCCTG 

AtTEACrCGTTGGCTAGAAAGAtCCAACGCTATACAATAATACAACATAA 

MCGGACGG7CA7CAAAGCCTCAAAGAG7GAACAGTCAACAAAAWAGTT 

gagccctgaggagtatcgtttcgcccatttctaceacgcaagocgaaaat 

aT TTGGCGCCAATCTT7CCCCCCT77C<^TTClCTCAGCTCAAAACATC 
QTTTCTCTCTCACTC7C7C7CACAA77CCAAAAAATGCAGCDCM^AGAr 
CGAT7T7CTCTT7C7TCCAAAAACCCACGGCGGCGAC7?^GAAGGG7tTG 
GT7TCCGGCGATCCTGC7AGCSGCGGGCCr5GCAGCGSAGACCACGATTT 
AATGTGAAGGAA€GGGATGCTAAAGGC<SACGCT7CTGTACG7T7TCCTG? 
T T CGAAA7CTGTCGA7GAG«7TAGAGGAACGGATACTCCACCGGAGAAGG 
TTCCGCGTCGTGTCC7GGCGTCTCGAT1TAAGCCGGCTGAATCCGCCGG7 
GATGCTTCGTCCCTGTTCTCCAATATTATGCATAAGriTCTAAAAGTCGA 
7WTCGAWTTGT7CrGGAGAGAGG?ACTAATCT7CGAtTC7CTTAATTT 
TG7TATCTrrAGCTGGAACAAGAAGATTCGTGTAATTTGTTG7ATTCGTT 
QGAGA^TTCTGAriACTGCAT7G<»TCGTTG77TACAAArrrTCAGGAC 
CCGAGAAGATGTrGTTCCGCTGAATGATTCATCTCTATGTAtGAAGGCTA 
ATCA7GTTATTCCTCAA7TTOG7TCCAATAA7GGTAAAACTCAAGAAAGA 
AACCAtGCTTTTAGTTTCAGTGGGAGAGCTGAACTTAGATCAGTACAAGA 
TATADGAGTAGATGGC5GA7GTTCCTGGTCCA^AAACAOCAGGGATGCGTC 
-CACG7GCTTCTCGCTTGAAGCGAGTTCTGGAG6ATGAAATGACTT7TAAG 

* gagi^taaggttcctctat^ggactctaacaaaaggctgaaaatgctcca 

GCAf CCGGTTTGtGGAGAGAAGAAAGAAGTAA^^ 

,KATGGCTTGAG7C77C7CGAATC^GGGA7GCC^ATAGAAGAGG7CCTGAT 
GA7CCCCTTTACGATWAJ^ACC7TACACATACCACCTl^tG7^TCAA 

Figure 11 
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GAAAATGTCTGC3i7£ACAAAAGCAATATTGG^TGTTAAGAGTGAATATA 2250 

TGG ACATTG7 5CTTTTCTT7AAftGTGGTTAG1' AAC7ATTAATC7AGTGTT 2 300 

CAA7CCATTTCC7CAATGTGATTTD77CAC77ACATCTG77TACGT7ATG 2350 

CTCtTCTCAGCSGGAAATTTTATGAGCTGTATGATCTACATOWCAAtTAG 2400 

GTC AC AAGC^GCTTGACTCGAAGA7<VlOCATGAGTCGTG7GGGAAAATGC 20 50 

AGACAGGTAAAT7A£?TGAAfcCAACTGGCCTGCTTGAA7TA7TGT5TCTA 2500 

TAAATTTTC WACCACCTTTtGTTTCAGGTTCG7ATCTCTGAAAGTGGGA 2 550 

TAGATCAGGCAGT^CAAAAGCTATTAGCTCCTGGGTAAGGGAACCATCAT 2600 

AC7TTATGGAATTCGT1^ACTGCTACTTCGGCTAGGATTTAAGAAATGGA 2650 

AATC ACTTCAAGC ATCATTA&TTAGGA7CCTG AG AACTCAGG ATGTTTTC 27 0 0 

TT A7TCGTTATATAA7Ai«TCtTTTCATCAACX3AGTAACAAACAAAACTT 2750 

C<»CAATAT7TGTCTGCTCACTGGCAAGGCATATATACCCAGCTAACCTT 2S00 

TGCT AGTTCACT57 AGTAACAGTTACGGATAATAt ATGTTTACTT^TATG 2950 

TGG7ACCCTCATTTTG7C7 CTCATGGAGGCTTTGAAGCCTTGTGT7GAAA 2900 

CTGGAT ACTTAC ATATGCTTCCAACAGAAACTAGC ATGCW3ATTC ATATG 2950 

CTTTCCTATTCTACTAATTATG7ArTCACACAC7CGTTCTTTCTrTTCAA 3000 

AGATATAAAGTTGGACCAA7 CCAGCAGC7A<;AAAC ATCTGACCAACCAAA 3050 * 

AGCC ACAGOTGiT; AATAC TOT AAG7r7TCTT3C ATAG^TCAAGGAGAGTC 3100 

TTGCAGACTCT77T7UA7C?fc77TC77777C7GTACA7TACT77CATGCTG 3150 

7AATT AACTCAA7GCC7A7T C7GG7 CT ^ A7T ATCAGA7 AAtTCCAAGGAA 3 20 0 

GCTAG7TCAGG7A77AACTCCA7C^CAGCAAGC<SACCnAAAC«7CGCGC 3 2 S 0 

CTCATGOCG7CCATt:77C77GCTA7AAAftGAGG7775T7ATTTAC77ATT 3300 

TATCT7A7CA7GTTC AGTTC ATCCAAGf CCTGAAAAA77ACAC7 CTTCTT 3350 

T ACC AATCTTCCATCAAGC7GTGT A AAGGAT77GG^AT7AGAAAA7C ATT 3 4 0 0 

A7TTGATCC7TTGTT7TATATCr7iAGAG^TTCCCTTGAAAAGATCTGTTT 3 4 50 

AAG A77C777GCAC77GAAAAA77C AATCTTTITAA&7 CAATC ZZZTACT 3500 

TTC7TACAATGA7CA1AG7CTGCAA7TGCATGTCAAGTAATA7CAT7CCT 3550 

7GT7AC70CATCCCCC7CTrXCT7AATGACCATTG7CTA7 GT7G7 STTTG 3 bO 0 

TCTCCTGTGCTG6AGAAAATGATAGCTGATCCAASCTC7ACATTA1 , CAT6 3650 

ATT AAGTAGCTGCTCAG5AATTGCCTTTGGTTACATTGCCTAATGG7 77G 3*? 00 

ATGTCAATT7TTCTTCTGAATC777ATTT7AGATCAAARTGGAGCTACAA 37 5 0 

AAG-TGTTCAACTGTG7ATGGATTTGCTTTTCT7C5ACTG7CC7GCCTTGAG 3800 

GTTTTGGGTTGGCTCCATCAGCGA7CA7CCATCATG7CC7GCTCT^I^]AG 33 50 

CGTTATT k GATCCA<^TAAGCAAGTGTAT7CTGTA7C77ATGTG7ACCATG 3 90 0 

TGACTTCCTGTGCATA7ATTTGGGTTGCAGGAACTAATTCTGAATCAOCA *930 

7TTGGT AT6TTTTTTCCAGMT7C7CCAAAGGAAGTGTTATATGACAGTA * 000 

AAGGTAAACTBC77«TATCGCCAGTTGttTTG77AAACAGAATTTAAGGT * 0 50 

AAATGACACTGGTTAATTTAAAGTGCAT ACATGTTGAAA VATTGCAGGGC ^00 

TATCAJ^GAAGCACAAAAGGCTCTA^^ * 150 

ATT7CAGTAGGCAAGC7AACTGACAATTTAACCGCTCAGCGAATGATAGG 4200 

7CTCTT AAACATTGCTAAT€7AGATG A7GTTT ATGTTTCAATCTAATACG * 2 5 0 
GTCTACGGCCGTACA<?TTGGCTCCAGTACCACAAGTAA7GG&GGATACAG 

ATC5CTGCTGGAGTTA«5AAATATAATACAATCTAACGGATACTTTAAAOGrr * 3 50 

TCTTC7C^7CATGG^iACTGTGCTGTTGATCGTCTAAATGAATGTGATGT 4400 
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TGCCCTTMTGCtCTTCGA«DDTAATTAATCATCTGTCTACGCTXfcMW 
TCrrcTTGCCTTGTTTAGTTTTTGCTTT^ 

ttcrtaacttacaotttctatctacttgcagctagaagatgtacttaagc 
a7<sggcatatttttccatj«:caagftta(cagceg1tgtt7cagaatt€at 
ggccagacgatggtaaj^tcttgagatatttaacaatagctctgatggtgg 
7ccttcaggcaactgcatatttcttttt7gataacttcaactagjwg3ca 

GACATAGAAGGAAAAAtTCTAATACTTCGTACGGATCTCCAGTAAGTAAT 
AGCCGATTTTTCTTTACCTATGTAGGGAOCTTCTACAAATATC7TGATAA 
CTGTGTTAGTCCAACTCGTAAGCGACTCTTAAG6AATTGGATCTGCCATC 

C ACTCAAAG AT GTA GAAAGCA T C AAT AAAC GGCT T CATCT A GTTG AAG AA 

77CMGGCAAACTCAGAAAGTWGCAAATCACTGGCCAGTATCTCCACAA 

ACTTCCAGACTlAGAAAGACTDCTCGGACGCAtCAAGTCTAGCCTTCGAt 

CATCAGCCTCTGTGTtGCCTGCTClTCTGGGGA^AAAAGTGCTGAAACAA 

CGACTAAGTATCJVATCACAAGTTTTCTGAGTAATGCC7TCCATGAGTAC5T 

ATAGGACTAAAACATTACGGGTCTACCIAAAGACTCTTC7CCTTC77TTG 

CAATG7CTGGTTAT7CATTACATTTCTCTTAACTTA7TGCATTGCAGG7T 

A AAGCATTTGGGCAAA7TGTGAAAGGG77 CAGAAGTGGAATTGATC7 GTT 

G7TGGCTCTACACAAGGAA7CAAATATGATGACTTTGCT77ATAAACTCT 

GTAAAC7TC'jr;.TATT WTAGGA/.AAAGCG«l^7AGAG7 1 ATT7 CTT7CT 

CAATTCGAAGCAGCCATAGATAGCGACTTTCCAAATTATCAGG7CCCCA7 

C7ATCTTTCATACT7TACAACAAAATGTCTG7CAC7AC7 C AAAGCAA7 GC 

ATAXGGCTTAGA7CTCAACTCACACCCCGAGGATCCTAAAGGGATTTGCT 

T7TTATTCCTAATGTTTTTBGATCGTTTGATTTATTTCTAACTTGAACTT 

A7TAATCTTG7ACCAGAACCAA6ATGTSACAGA7GAAA>CGCTGAAACTC 

TCACAATACTTATCGAAC77TTTATCGAAAGAGCAACTCAA7CGTC7GAS 

GTCA7 7C AC ACC A7AAGCTGCC7AG A7GTOCTC A^ATCr 77TGCAA7 CCC 

AGCAAGTC7CTC7GCTGDAAGCA7DBtCACGCCTGTTAT777TCCCGAAT 

CAGAACC7ACAGA7CAGAATCAGAAAACAAAAGGGCCAATACT7AAAATC 

CAAGGACTAT6GCATCCATTTGCAG77GGAGCCGATGGTCAATTGCCTGT 

7CCGAATGATATACTCCT76GCGAGGCTAGAAGAAGCAGTGGCAGCA7TC 

A7CrrCGCTCA77GT7ACTGACBGGACCAAACATGGGCGGAAAATCAACT 

C7TCTTCGTGCAACATGTC7GGCCGTTATC77TGCCCAACTT7GTATACT 

mttagataattactctattcwtgcaatcagttcttcaacatgaaxaat 

PAATTCTGT7TXCTGTC7GCAGCTTGDC7GCTACGTGCCG7GTGAGTCT7 
GCGAAArCTCCC70GTGGATACTA7CXTCACAAiGGCTTCGCGCATC7GAT 

AGAATCATGACAjG^GAGAGTA^ 

GAACTATT7ACTCAGATlT7GTCTGATTGGACAAGGTGGTtTTCC7777T 
77TAGGTACCTTTTTGGTACAATGCACTGAGACAGCGTCAGTTCTTCAGA 
ATGCAACTCAIWATTCACTAGTAATCCTTGACGAACTGGGCAGAGGAAC7 
AGTAC77TCGATGGATACGK^TTGCKTACTCG<?TAACC7GCTC7TCTCC 
TTCAACTTATAC7TGTTGATC^ACAAAAACATGCAAtTCATTTtGCTGAA 
ACTTATT^TT7ATATCAGG?TTTTCGTCACCTGOTAGABAAAGTTCAAT 
C7OT^TCCTC77TGCAt^ACAT7ACCACCCTCTCACCAAM 
TCTCACCCACKTGTCACCTCGAAACACATGGC7TGCGCATTCAAATCAAG 

Figaro 1L (CdotiAued) 
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ATCTGATTATCAACCACGTGGTTCT<3ATCAA6nCCTACTGTTCTTSTACC 

GTTTAACCGAGGGAGCTTG7CCTGAGAGCTAC0CACTTCAAGTGGCACTC 

ATGGCTGGAATACCAAACCAAGTGGTTGftAACAGCATCAGGTGCTGCTCA 

AGCCAT<3AACAGATCAATTGGGGAAAACTTCAAGTCAA<3?GAOCTAAGAT 

CTOAGT^CTCAAGTCTGCATGAAGACTGGCTCAAGTCATTSGTGCGTATT 

TCTCGAGTCOCCCACAACAATGCCCCCAtTGGCSAAGATGACTACGACAC 

T-TTGTTTTGCTTATGGCATCAGATCAAATCCTCTTACTGTGTTCCCAAAT 

AAATGGCTATGACATAACACTATCTGAAGCTCGTTAAG7CTTTTGCTTCT 

CTGATGTTTATTCCTC^TAAAAAATGCTTATATATCAAAAAATtCTTrCC 

TCGATTATAACAAGATTATATATGTATCTGTCGGTT7AGCTATGGTATAT 

AA?ATATGTATGTTCATGAGATTGGTCAAGAGAAATACTCACaAACAGTA 

TATTAAGAAGGAAATATGTTtATGCATTAATTTAAGTTTCAAGATAAACT 

GCAAATAACCTCGACTAMCTTGCAAAGACCAAACACAAATTACAAAACT 

TATAA6ACTTAAG»rCTGAATTCCCTAAAA<XAAAAAAAAAAACAGAACA 

TATTTTCTTGCATCTACAAACAACACAAACCTACATAGTTTATAACTTAC 

TCATCACTGAGATTAACATCAGAATCATTCTCCATTTCTTCftTCTTCACT 

CTCATCATCATCACCAOCACCATCATGAT-rCtCCTCCTCTTCACGTAACC 

TAGCAATCTCACTCTGAGCTCTATCAACAATCTGCTTCT7CTGCAACTCC 

AAATCTCTCTGAAAA7CAGCTCTCATCTTCTCCAACTCCTTCATTTGCTC 

TTTCTTACTCTrCTCCATCTTCICATAAACCTTCCCAAACCTCTCAACAG 

AATCCGCCAAC AT CTTAT ACGAAGCAGC 5TCATTAACCTTCTTCCTCTCG 

TACTC AACCTCAT C ATCCTCATCCTCCTCCT CTTCAG AATCACCAGGACT 

ATCCATCATCTCATCAAACCCATTAGACTTATCTAAATAAACCTTAGTCT 

TCAIAAACACAAACTCACCTGAATCAACACCACAAGCTAAACCTAAATCC 

GACTTGGG(^AAACACAAAGCAACATATCCAaCrTATTG.^AAAACGACCA 

TTTACTTGAACCTAAACCTGATTTCTCAACCTTAATCTTCTCTTTTCTAT 

ACTTCCTCTTCAAGTCRTCAATCATTCTCCTACATTGCGTCTCAGATTTC 

TCCATCCTTACCTCCTCACTCACTTTCTCAGCTACTTCATTCCAATCCTC 

BTTCCTCAAACTCCTTCTACCCAATTGCAAAAACCTATCTCCCCAAACTT 
CAAGCAACACAA 

ffigur* 11 (Continued) 
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Figure 12 
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ComirH>nU/Ref«wiD0a: AtMSHe 37AlMSHa 3* antfesniSB : AtMSKfi (SB) 3* side (62-l379bp1 
&aM/S*M/T4 Into pPFl3 (pCWW (Sfi> 3' *id* (13-2104> MdwniySmftL in 

LBM404 
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Figure 15 
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pCWtMVSmat. <n LBA4404 
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Figure 16 



Comm«nt6m*f«rdneo9: AtMSHi (SB> complWi. orientation : pPF2B (334abp) 

Bmat Into pCMM64 Smai 
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SEQUENCE LI St IMG 



Oio^ Rhone- Pouienc A£ro; Be^sner f Andreas Stefan; Doutriaux. 

Marie- PBBcala; Freyssinet, Georgea ; Perez, P&scual . 

<120:> Methods £or cbtAintng plant variBtias 

<130> 39S498C 

<150> P09745 

*L51> 1997-10-10 

<160> 93 

<210> i 

<212> DNA 

•c213> Artificial B*>quAttc* 

<220> * 

.=.221 > inCKSif i Ad_hAfift 

<222> 11 

;223> I 

<220> 

c 22 1 > modi £ i etf^ba se 

<2-22> 14 

<22Z> I 

<22Q> 

c 2 2 1 > rnodif ied_baae 

<222> 17 

<223> I 

<220> 

<222> Deganerat« oligonucleotides Ut>MU used CO isolate. ACMSH3 and 
c$00> 

<30l> Ra^nan and Kolodner 

O02> Genet ica 

<303> 132 

<306> 953-973 

O07> 1992 

<400> i 

ctggatccac nggnccnaay atg 23 



<210> 2 
*211> 23 
*2l2> DNA 
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-,213 > Artificial sequence 

<220> 

<22l> modi f ied_base 

<222> 15 

<:22 *> I 

<220> 

<22i> rnodif ied^Jb&ae 

<222> 18 

<223> I 



<220> 

*2"> Degenerate ol igonucl eotidee domd used to isolate AtMSH3 and 

AtMSite. 

<300> 

<302> 
O03> 

O07> 

<4O0> 2 
ctggatccrt: artga^cnrc raa 



R&enan and Kolodner 

Generics 

132 

1992 



<2i0> 3 

<211> 24 

<212> DMA 

<213> Artificial sequence 

-:22D> 

<223> Bpacific primer 6 36 for PCR using cDNA o£ Arabldopeis 

t ha Li ana ecotype Columbia 

<400> 3 

tgctagtgcc tcttgcaagc tcAt 24 

<210> 4 
<211> 27 
<212> DNA 

<2l3> Artificial sequence 

<220> 

<222> Priirer aPi for pcjt using cDMA of Arabidopsis thaliana ecotype 

Columbia containing Ad&pcer sequences ligated to both its 
enda 

<400> 4 



3DOCID: <WO_ 9919492A2TI > 



WO 94*2 PCT/EP98/A6977 



ccatcctaac ac^actcact atagggc 27 

<211> 13 
-c2l2> DMA 

<213> Artificial sequence 

<220> 

<223> Primer AP2 for PC* using cDMA of Arabidopsie tha liana ecotype 

Columbia containing adapter sequences ligated to both its 
ends 

<4D0> 5 

actcactata gggctcgagc ggc 23 



<3i0> 6 

<21i> 30 

*212> DWA * 

<2i3;> Artificial sequence 

<220> 

<22l> MSH3 specific primer S525 Eor PCF ueing cDNA of Ar*bidop3is 
tftaliana ecotype Columbia 

<400> 6 



aggttctgAl! ctxcgcgtgac gcttcactta 3Q 



<210> 7 

<2tl;> 2$ 

<212> OKA 

<213> Artificial sequence 
<220> 

<223> specific primer S51 for PCR using cdiOA of Arabitfopaia 

thaliana ecotype Columbia 

7 



ggatcgggta ctgggttttg agtgtgagg 29 



<210> 8 

<2ll> 24 

<212> SNA 

<213> Artificial aequeitc* 
<220> 

<223> MSH3 spaciiic primer 63 5 for PCR using CDNA of Arabidopaie 
th3li^n«k ecotype Columbia 
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4 



gcacgtgctt gatggrgttc tcac 

24 

<211> 28 
<2l2> DMA 

*21 3 > Artificial sequence 

<220> 

<400> 9 

tcagacagta tccagcatgg cagaagta 



2a 



<21Q* id 

<211> 33 

<21*> Artificial sequence 
<220> 

<400> io 

accccgggac gggcaagcaa aagcagcaga cga 

<2lO> n 
<2Xl> 21 
<2I2> 

Artificial Bequeace 

<220> 

gacaaagagc gaaatgaggc cccttgg 

27 

<210> \2 
<21\> i2s 0 
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B 

<:400> 12 





U *A W — ■ Lvf U. LA 




r- 1" 1- •p * 
V U U L V 


C'.tt eg cccc 




60 


t c cccga etc 


v*. ~p *.* w v ~H 


u u.y y U <ay U- V- 


iTa a f ^ ^ ■> 
y a -a L\.cLv*ai^ 




ga.agatatcc 


120 


gccaccgtat 


ccttctctcc 


tti"' caaoca t 

w w — ^» *t>* y %4 W 


a acre ht r I* r»t* 




cgc cgc cgcg 


1 BO 




aqcc t a aact 

W U ^& W 






c ag vacc cga 


tcccaattta 


24 0 


cacca&aaat 

^* ^»^V% 10* ^» 








»yyu a CaLy L 


ccccgaaacg 


3 00 


ccaccatcga 


<y ct a a a t a cac 


a c^a t toes a 




t» g y z*g ceo aci 


gage aag u a c 


3 50 


c c^fl 3 test ci-a 


1 1 1 t-rrat not" 

L l_ U l*. y a. L <- 


.•TP »~r 3 <~t t* ^ ni*rt 


I. AC dyu L aCa 


ga^ c Cv c egg 


agaagacgeg 


420 




<_<a i_y '— y T. L. 


M t L LLUL 


gcccau.ar.gg 


at cac2 at 1 1 


catgaeggeg 


480 


^5 1" rr^ o ^ ^ 

c^yuyLytLciCi 


till «. LC<JtlfcL 


g<i « k- r- 1- (7C a t 


9 v y ci ag fic 


cggtgaafcgc 


aggatacaag 


S4 0 






L'.JdddCt ^Co 


SJ — LJ LLB at^ L. 


nc:cacggcgc 


aaaccggacc 


&0D 




•w c y gg g ac l. 




t«it ac casict<j 


ccacgcttga 


ageggctgag 


650 


17a tat" a a n t pt 
yavaw <*«y ^y 




X yytys a^da 


yy l. llC y g t. C 


cacagagtaa 


ttt;cctggt t 


720 


f 0"T o t~ tot. r*»rr 




t aayL C- g^«*3 


a ca c- c aggc u 


gt gg t a c tga 


aatgagtttc 


730 




t-\.^£y Kj* ^~!» *— 


tenret tea ^ 


l, L. L. L. y '3 T _ 3 VJ 


y ^ g^* 3 g c g *- 


v vacg aagag 


d4u 


tit csa t -gat a 


W ^ 4k» U» k» %J U.VI 


a aatooat ^ a 




*- *m l. l. y v. L. to. 


y L CdCGdyCt 


Q i\ 


^7 «3 ^ ^ «" v 


c 1 cc-acre c 

W ™^ %^ 


tc 1 1 1 cacaa 


as c t oa aa 


•« -ji v k+ \* vy y ^ 


y y tp«i^w> <xuy v> u 




ggacctacct 


caaacgtccg 


agtggaacgt 


gcctcaccgg 


at tgc ttcag 


caaeggcaat 


1C20 


•gcagtagacg 


aggttatCtc 


attatgegaa 


aaaatcagcg 


caggcaactt 


agaagatgat 


1080 


a&agaaatga 


agctggagge 


tgctgaaaaa 


ggsatgtcLt 


gcttgacagt 


tcatacaact 


1140 


atgaacatgc 


cacatctgac 


tgctcaagcc 


ctcgccctaa 


cgttttgcca 


tctcaaacag 


1200 


fcttggatttg 


aaaggatcct 


ttaccaaggg 


gcctcacttc 


gctctttgtc 




12 50 



-210> 13 

<211> 34 

c212> PWA 

<213> Artificial aegudnc* 
c220> 

<223> M5H3 ap^cific priencr 2S5 for PCR using cDNA of Arabidopais 
thaliana ecotypa Columbia 



vlSDOClD: <WO 9919492A2TI > 



WQ99/IW92 

PCT/EP98/969TT 

6 



acccr^gycc aaaacgaaca agtcggcttfc agtc 

<210? \n 

<2H> 27 

<212> OWA 

<213> Artificial sequence 



34 



<220v 
<223> 



MSHi specific primer S5Z for PCR using CONA of Arabidopaia 
cna liana ecotype Columbia 



<4D0> 14 

gccacatctg actgrtcaag ccctcgc 



<2io> is 
<2ti> 2na 



27 



<2l2> PNA 

<213> ArvaJxirtopsi* thitiiiina ecotype Ccilursbia 

<223> Clone 13 

<4 00> j.5 

gccacatctg actgttcaag ccctcgcccc aacgtettgc catctcaaac agtttggaei: 60 

tgnaaggatc cettaccaag gggcc^catt Ccgctctttg tcaagtaaca cagagatgac 120 

cccctcsgcc aacactccgc asca S ;t 3 ga ggttgtgaaa aacaattcag atggatcgga 160 

atctggc^cc ttactccata atatgaatca cacacctaca girataeggtt. ccaggcttct 240 

tagacactgg gcgacccratc ctctacgcga tagaaacetg atatctgctc ggcttgatgc 300 

tgtttctgag atetctgcec gcatgggatc tcacagttct tcccagccca gcagtgagtfc 3«o 

ggttgaagaa ggttctgaga gagcaattgt atcacctgag ttttacctcg tgctctCCtc 420 

agtcttgaca gccatgtcta gatcatccga taetcaacgt ggaataacaa gaatccttca 4S0 

Ccggactgct aaagccacag agttcatfcgc agttatggaa gctattttac ttgcggggaa 540 

gcaaattcag cggcctggca taaagcaaga ctctgaaatg aggageaegc aatctgcaac $00 

tgtgcgaccc actcttttga gaaaattgat ttctgtcatt tcatcccctg ttgtggttga 660 

caatgccgga aaacttctct ctgccctaaa taaggaagcg gctgttcgag gegacttgct 720 

cgacatacta atcacttcca gcgaccaact tcetgagctc gccgaagctc gccaagcagt 730 

ttcagtcatc agggaaaagc Cggattcctc gacagcctca tctcgcaaga agcccgctat 840 



WO 9492 PCT^F**^??? 
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tcgaaat t tg 


gaatttctfcc 


.sagtgccggg 


gatcacacat 


ttgatagagc 


tgcccgttga 


90 0 


t cccaaqgtc 


cctatgaact 


gg^cgaaagt 


aastagcacc 


aagaagacta 


tfrcgatatca 




ccccccagaa 


a cage age tg 


gcttggatga 


getageceea 


goaactgaac 


accttgccac 


1020 


tgtgsaccgs 


get tcgtggg 


atagtttcct 


csagagtttc 


agcagatact 


acacagattt 


}0£0 


taaggctgcc 


gttcaagetc 


ttgetgeact 


ggactgtttg 


cactcccttt 


caactctatc 


1140 


tagaaacsag 


aacfcatgfccc 


gtcccgagtt 


cgtggatgac 


tgcgaaccag 


ttgagaeaaa 


12 00 


catacsgtct 


ggtrgtrcatc 


ctgtactgga 


gactatatta 


caagataact 


tcgccccaaa 


1260 


tgacaeaatt 


ctgcacgcag 


aaggggaata 


ttgecaaatt 


atcaccggac 


ctaacatggg 


1320 


aggaaagagc 


tgetatatec 


gtcaagttgc 


ttcaacttcc 


ataatggctc 


aggttggttc 


1350 


ctttgtacea 


gegtcaeteg 


ccaagccgca 


egtgcctgat 


ggtgttttca 


c t egga tggg 


14 40 


cgcttcagac 


agtatccagc 


atggcagaag 


tscctttcta 


gaagaatcaa 


gegaaocate 


150O 


acacataar.c 


3<$aacrtgr.t 


cttct cgttc 


qcrtgs C9ta 


t Cagacgagc 








uacgacg^tg 


vagccaccgc 


etatgeaaca 


t t acagcatc 


tcccagcaga 


162Q 


aaagagatgt 


ttggttcttt 


tcgtcacgco 


t taccctgaa 


a t age t gaga 


tcag taacgq 


1630 


actcccaggt 


cctgttggga 


cataccatgt 


ctcgcacctg 


acattgeaga 


aggataaagg 


1740 


cagtcacgac 


catgatgatg 


cgacctacct 


abataagect 


gtgcgtggtc 


e ttgcageag 


18 00 


gagct ttggt 


t ttaaggttg 


ctcagcttgc 


ccagacacct 


ccatcatgta 


taogecgage 


- ie*o 


catttcaaxg 


getgeaaaar. 


tggaagctga 


ggcacgcgca 


agagagagaa 


atacaegcafc 




gg^agaacca 


gaaggacatg 


aagaaccgag 


aggegcagaa 


gaatctactt 


eggctctagg 


1980 


CgactCgtCt 


gcagacctga 


aatttgetet 


ctctgaagag 


gacccttgga 


aagcattcga 


2040 


gtttttaaag 


catgcttgga 


agattgetgg 


caaaatcaga 


ctaaaaccaa 


ctcgtccatt 


2100 


ttgacccggg 














<210> 
<211> 

<2l2> 


1* 
29 
DMA 













<213> Artificial sequence 

<220> 

<223> MSH3 specific primer 551 fox PCJl uaxng cJWA ot Arabidopsis 

thaliana e cot ype Columbia 

<400> 16 
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3 

99T*tcgggta ctgggttttg agtgtgagg 

<210* 17 
*711> 30 
<212> DNA 

<213> Artificial sequence 

<220> 

t^^fr Ci£i ° Primer 3525 f ° r * C * U9i ^ ot Arabidopai 

t ha liana ecocype Columbia ww * aopai 

35gttctgac tatgtgtgac gcctca^ta 

<2lB> is 

*211> 3522 

<2l2> DNA 
<2VS> 



23 



30 



AraJ>ldop*i* cfivlixna ecotypc Coiufttti* 



<220> 

^22 1> CDS 



<222> (100 ) <}342) 

<400> IS 



cc.aagasag cgegcgaaaa tcggcaaccc aagetcgcca tagccacgao cacga<c tC c 
cattcctcrt aaacggagga gattacgaat aaagcaact 

22 ?T Sf* C&g a ° 9 atC tct Ctc tec get ccc aaa 

Wet Oly Lye Gin Lya Gin Oin Thr lie Ser Arg R. Phe Ala Pro "a 



3 10 



15 



Pro Ly* ser £o *£ mi S* CBg m C ° 9 gt * SCC »- tc * c ~ •« 
V " Ser *?? Hl * Pro Val Ala Glu Ser ser Thr 

20 35 3 0 

Pro" Pro !2 f 35 % Ca CC ° 900 * cc * Ca tcc "* tct ccc tec aag cgt 
Pro Pro Pro Ly* n e Ser Ala Thr val ser Phe ser Pro ser vyl Irl 

35 40 45 

ttl fl* f" tCC gac Ca ° Ctc « cc *= c gcg tea ccc aaa aa, ecu aaa 
Lye Leu t eu ser a*p hi* ^ u Ala AU Ala 5ec Pro Lya ~ J£ 

55 60 

Leu Sar Pro IT S? d aaC CM * ta CCC * at ccc * ac tta 

Leu Sar Pro Hxa Thr Gin Asn Pro Val Pro Aap Pro Asa Leu Hi a 0 £ 



SO 

195 
243 
2*1 
339 
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aga ttt ccrr cag aga ttt ctg gaa ccc ccg ccg gag gaa tat gtt ccc 397 
Arg Phe Leu Gin Arg Phe Leu Clu Pro Ser Fro Glu Glu Tyr Val pro 

35 $0 9S 



$aa 3 eg tea cca teg agg aaa cac aca cca ttg gas cag caa gtg gtg 
Glu Thr Ser Ser Sar Arg Lys Tyr Thr Pro Leu Glu Gin Gin 'Ml Val 
10U 105 110 



435 



gag eta aag age aag tac cca gat gtg gtt ttg atg gtg gaa gtt ggt 4S3 
Glu Leu Lys Ser Lys Tyr Pro Asp Val Val Leu Met Val Glu val Gly 
US 120 125 

tac agg tac aga ttc ttc gga gaa gac gcg gag ate gca gca cgc gtg 53 i 

Tyr Arg Tyr Arg Phe E>he Gly Glu Asp Ala Glu Tie Ala Ala Arg Val 
130 135 14 0 

ttg ggt att tac get cat atg gat cac aat etc atg acg gcg agt gtg 57 9 

Leu Gly lie Tyr Ala Kt3 Met Asp His Asn Phe wet Thr Ala Ser val 

1« ISO 155 ISO 

cca aca ttt cga ttg aac ttc c&z gcg aga aga ctg gtg aat gca gga $2 7« 

Pro Thr Phu Arg Lou Asn Pk© His Val Arg Arg Lev Val Asn Ala Gly 

16* 17$ 

tac aag ate ggt gta gtg aag cay «ct gaa act gca gec att aag tec &*?S 
Tyr Lys He Gly Val Val Lys Gin Thr Glu Thr Ala Ala lie Lys Ser 
180 IBS 190 

cac ggt gca aac egg acc ggc ccx ttt ttc egg gga ctg teg gcg ttg 723 
Bis Gly Ala Asn Arg Thr Cly Pro Phe Phe Arg Gly Lfeu Ser Ala Leu 
195 2D D 20S 

tat acc aaa gec acg ctt gaa gcg get gag gat 31a agt ggt ggt tgt 
Tyr Thr Lys Ala Thr Leu Glu Ala Ala Glu Asp T.l* s*r Gly Gly Cys 
210 21S 220 

ggt ggt gaa gaa ggt ttt ggt tea cag age aat ttc ttg gtt tgt gtt 819 
Gly Gly Glu Glu Gly The Gly Snr Gin Ser Aen Phe Leu Val Cya Val 
225 230 235 240 

gtg gat gag aga gtt aag teg gag aca tea ggc tgt ggt abb gaa atg 867 
Val Aap Glu Arg val by* Ser Glu Thr L*u Gly Cys Gly jl* Glu Met 
245 250 255 

agt ttt gat gtt aga gtcr ggt gtt gtt ggc ?tt gaa att teg aca ggt 915 
Sar Phe Aep Val Arg Val Gly val Val Gly Val Glu He Ser Thr Gly 
260 265 2T0 

gaa gtt gtt tat gaa gag fctc aat gat aat ttc atg aga agt gga tta 963 
Glu Val Val Tyr Glu Glu Phe Ann Aap Asn Pha Met Arg S*r Gly Leu 
275 280 2ft5 



MSOOCID: <WO 9919492AZTI > 
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1 0 

SS III 111 Til lit HI HI r « »« « «■ «* «t c« g 10 11 

*** 5*£ Ser PrD Ala ^ Leu Gly <31n 

300 



cc~ cct tea caa ca<t acr. ctsq a^c* h „„ „ 

Pro Leu Ser Gin Gin Thr II^ t ~ , 9 * * Cat SCC g * a cct 

305 GlU LyB **• Leu Vsl * la M «* Ala Gly p ro 

- - - - 5 » - s c s 2 5 5 s: c - 

330 335 

s s e e.s K s s = a 5 s 5: a = s 

345 350 

?: z 2 s c 5 - s - 3 2 s s a s 

360 365 

3S* atg cct tgc teg .ca gT.c cac aca act arq * Aa cat et „ 

My Mec Ser eye Leu Thr V.l Hi, Tbr Tic Met Asn S SI 2 

3 7S 



acc 3tt gec etc gec ctA acq ttt tur<r 

rhr va. G >„ Al3 Ueu M ^ ^ ^ ^ ^ ^ t« gga 

330 400 

III n! I" ^ C C " " C 5 " 9&c tea trt^ege r.cc ttg tea acrt 

Phe Glu Ar S lie Leu Tyr Gin Cly Al* Scr Pbc Arg Ser L«u s^x Ser 

405 415 

^hr IS 22 I." CtC 9CC a " C aCt c «* c *° «9 «g gag « t 

JUn Thr «u >,ec Thr Leu Ser Ala As» Thr Leu Gin Gin Lo„ Glu Val 



420 42S 



430 



E? II I « Ca 93t 933 tCS gaa tcc ggc tce "* "c car aac 
Ly* as„ Aan Ser Asp Gly ser Glu Ser Gly ser Le« rbe Hia IS 

5 4 *° 445 

a £ a s s s s 5: 5 «% c 5 s = a s s 

455 460 



53 III S Ct Cta t9 ° 9 " a9a aat tta Aca tcc Set egg cct gac 
v*J llw Hi, Pro Leu eye Aep Arg Aan Leu tie Ser Ala A% Leu Hp 

470 "5 4 BO 

si: S £ SI £ S ST I cc cat asc tct tcc c ** 

xie ser Ala Cy& Met Gly Ser Hie Ser 3er Ser Oin 
*&5 4 9Q 



1P59 



1107 



li55 



X202 



1443 



153 7 



; OOCID: <WO 9919492A2TI > 



WOW/19491 



1 1 

etc age age gag teg get gaa $aa ggs tct gag aga gca att gta tea 16 3 5 

r-r-u ser Ser Giu Leu v*i gi u Glu Gly Ser Glu Arg Ala He Val Ser 

SOD SOS 51Q 

cct gag ttt tat ccc gtg etc tec tea gtc ttg aca get atg tct aga 1633 
Pro Glu Phe Tyr Leu Val Leu Ser Ser Val Leu Thr Aid Wet ser Arg 
S15 520 525 

tea tct gat att caa cgt gga ata aca aga ate ttt cat egg act get 1731 
Ser Sex Asp lie Gin Arg Gly Ila Thr Arg lie Phe Hia Arg Thr Ala 
530 535 540 

aaa gec aca gag etc att gca gtt atg gaa gec act tta c'tt gcg ggg 177 9 

Lys Ala Thr Glu *he lie Ala Vai Wet Glu Ala lift Leu Lea Ala Gly 
545 550 553 S6O 

aag eaa act cag egg ctt ggc aca aag ca* gac tct gaa atg agg agt 1927 
Lys Gin He Gin Arg Leu Gly lie Lys Girt Aap Ser Glu Met Arg Ser 
565 576 575 

atg caa tec gca ace gtg cga tct act ctt ttg aga aaa ttg att tct 1B?5 
Met Cln Ser Ala Thr val Arg Scr Thr Leu Leu Arg Lys Leu lie Ser 
Sao 585 59f> 

c$tt atr. tea tec cct gr.t gcg gvv. gac sat gec gga aaa ctt etc tct iS2i 
Val lie Scr Scr Pro Val Val Val Aap A&n Ala GLy Lya Leu Leu Ser 
S9S 600 £05 

gec eta aat aag gaa geg get gtt eg* ggt gac ttg etc gac ata eta i$7l 
Ala Lou ASii Lys Clu Ala Ala Val Axg Oly Asp Leu Leu Aep He Leu 
610 615 *2D 

stc act ccc age gac caa ttc cot gag ctt grt gaa get cgc caa gca 201$ 
lid Thr Ser £er Asp Gin Phe Pro Glu Leu Ala Glu Ala Arg Gin Ala 
625 6ZO 635 640 

gtt tta gtc ate agg gaa aag ctg gac tec teg ata get tea ttt cgc 2067 
Val Leu val He Arg Glu Lye Leu Asp Ser Ser lie Ala Ser Phe Arg 
645 650 £55 

aag aag etc get att cga aat ttg gaa ttt ctt caa gtg teg ggg ate 2115 
Lys Lys Leu Ala He Arg Aan Leu Glu Phe Leu Gin Val Ssr Gly lie 
660 €€B <7Q 

aca cat ttg aca gag ctg ccc gtt gat tec aag gtc cct atg aat tgg 2 16 3 
Thr Hia Lau Ha Glu Lbu Pro Val Aap Ser Lys Val Pro Kifi Aatl Trp 
S75 680 685 

gtg aaa gta aat age aec aag aag act act cga tat cat ccc cca gaa 22 li 

Val Lys Val Asn Ser Thr Lya Lya Thr lie Arg Tyr His Pro Pro Glu 
690 700 



NSDOCIO. <WO 9919492AZTI_> 



12 SS K ST, 2 g n ~ III i E 2 K T. X K - 

710 720 

act 3C9 *ac ega get teg tgg qac *a+ 

»u « « Arg « S J » 25 2; £ ~ - s «• s ™ 

730 735 

- - = 5 s s-s s: « a e 2 s s 2: s 



tgt teg cac tec etc tea acc eta tct aga aac aag aac tat ot r 

LEU "'I -* ^ 3 " *« - ser A? g ^ L ** £ £ «2 2g 



76S 



830 



six "e S2 S 2° *** * CC Cta * aa *** * ta *St W gcg 
-jr He Cln « 4 b G1 y ^ Ser T „ r phe ^ ^ cl<j ^ ^ j£* 

870 87 5 0dD 

52 »? C « ° ^ * CC C9C tCC tCt « tc * ctt gtt ata tta gat 

B« H,a lie tie Arg Thx Cy B Ser ^ Ser ^ £J J£ Asp 



2355 



ccc gag ctt gtg gat gac tgt ga* cca gtt qao afcA 

»» ot. fhe v.i « P *«p cy, P „ sj sn e e e e 2451 
5 *'b ass e 22 s e s: k s e 2: 2: e e - 

IJ * 795 eoo 

aat gac aca acc ttq nac qca q^a 

a. p TOr 31 . ^ Bi. e e e £ 3: e s e s 2541 

S HI E 5 5 K 2 K 5 £ £ Arg Sta Si S 2 



s 2: s a s a s m 1- ss ^ = a = 2: k - j 

935 «« 845 

aag ctg cac gtg ctt gat ggt gtt ttc act egg atg ggc get tea a**. 
vy 9 jj» Hl3 val Aap oly val Ph. Thr A ?g „*? «I 52 2p 



2787 



BBS B90 >t p * 

- - - s ^ ^ c ?s ^ s oT y m s: 5 - - 

? " 910 
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gca aca tta cag cat etc eta gca gaa aag aga cgt ttg gtt etc tct 28 B 3 

Ala Thr Lou Gia His Lou Leu Ala Glu x,ys Arg CyS Leu Val Leu Phe 
SIS 920 925 

gtc acg cat tac cct gaa sta get gag ate agt aac gga ttc cca gge 2S>31 
Val Thr Hia Tyr Pro Glu Lie Ala Glu lie 3er Asn Gly Pbe Pro Gly 
*30 935 940 

tct gtt ggg aca tae cat gtc teg tat ctg aca ttg cag aag gat aaa 2 97 9 

Ser val Gly Thr Tyr Kia Val Sex Tyr Leu Thr Leu <3ln Lya Asp Lya 
S45 350 955 960 

ggc age- tat gat cat gat gat gtg ace tae eta tat aag ctt gtg cgt 3057 
Gly £er Tyr Asp His Asp Asp Val Thr Tyr Leu Tyr Ly& Leu Val Arg 

970 $75 

ggt Ctt tgc age agg age ttt ggt ttt aag gtt get cag ctt gee cag 3075 
Oly Leu Cys Ser Arg ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin 
930 995 990 

aca cct cca tea cgt ata egt cga gee ait tea acq get gca aaa ttg 3l2> 
lie Pro Pre Ser cys He Arg Arg Ala Tie Ser Met Ala Ala Ly & uu 
995 100D 1005 

gaa get gag gta cqt gca aga gag aga aa: aca cgc atg gga gaa cca 3171 
Glu Ala Glu Val Arg Ala Arg CJlu Arg AST) Thv Arg Met tsly Glu Pro 
1D10 1015 1020 

gas gga cat gaa ^aa ccg aga ggc gca gaa gaa tct att teg get eta 3 219 

Glu Gly Ki£ Glu Glu Pro Arg Gly Ala Gl« Slu He Ser Ala Leu 

1025 1030 1035 1040 

ggt gac ttg ttt gca gac ctg aaa ttt gcr. etc r.cr qaa gag gac cct 3267 
Gly Asp Leu Phe Ala Asp Leu Lys Phe Ala Leu Ser Glu Glu Asp Pro 
1045 10S0 1055 

tgg aaa gca ttc gag ttt tta aag cat get tgg aag att get ggc aaa 3315 
Trp Lye Ala Pbe Glu Phe Leu Lya i*ie Ala Trp Lys He Ala Gly Lya 
1060 1065 . . 1070 

ate aga eta aaa cca act tgt tea, ttt tgacttaatc tcaacattat 3 362 

lis Arg Leu Lye Pro Thr Cys Sar Phe 
1075 10*0 

ageaactgea aggtcttgat catctgttag ttgegtacta acttatgtgt attagt-ataa 3422 

caagaaaaga gaattagaga gatggattct aatccggtgt tgcagtacat cttttcccca 34 a 2 

cccgcataaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3522 



<2l0> 19 
<211> 10B1 
<312> PRT 
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<213, Ar^i<*op*is tAaiian* ecotype Columbia 

f .4 0O> 19 



Met Gly Lys GLn Lye Gin Gin Thr lie Ser Arg Phe ?he A la Pro Ly3 

Pro Lys S „ Pro Thr Hia Glu Pro Aen Pro Val Ala Glu Ser Ser Thr 

Pro Pro Pro Ly S He Ser Ala Ihr val ser Phe Ser Pro Ser Lys A rg 
3S 40 45 * 

Ly S Leu Leu Ser A 5 p Hi., Leu Ala Ala Ala Ser Pro Ly S Lye Pro Lye 

S 0 

Leu Ser Pro Kia Thr Gin Asn Pro val Pro Asp Pro Asn Leu » is Gln 

70 75 80 

Arg Phe Leu Cln Arg Phe Leu Glu Pro ser Prr> Glu clu 



85 



90 



Glu Thr ser Ser Ser Ar 3 uys. Tyr Thr Pro Leu clu Gin 



100 IDS 



Tyx Val Pro 

95 

Gin Val Vol 
HO 



Glu Leu us,, ser l,y 8 Tyr Pro Asp V«l Val Leu Met Val Glu Val Gly 
115 "0 125 Y 

Tyr Arg Tyr Arg Phe Phe Gly Ola A*p Ala Glu ri« Ala Ala Arg Val 

Leu Cly lie Tyr Ala His Wee A** His AO* Phe Me. Thr Ala ser Val 

LS ° 155 „ 0 

Pro Thr Phe Arg Leu Asn Phe Hi* val Arg Arg Leu Val A W Ala Gly 

170 375 

Tyr Lys lie Gly Val Val Ly 5 Gin Thr Glu Thr Ala Ala He vys ser 
190 la * 19o 

Hie Gly Ala Asn Arg Thr Gly Pro Phe Phe Arg Gly Leu Ser Ala Leu 
195 2 °0 205 

Tyr Thr Lys Ala Thr Leu Glu Ala Ala Glu Asp He Ser Gly Gly eye 

215 220 
My Gly Glu Glu Gly Phe Gly Ser Gin Ser Asn Phe Leu Val Cye Val 



23C 233 



240 



Val Aep Glu Arg Val Ly,. ser Glu Thr Leu Gly Cye Gly He Glu Met 
245 250 2S5 
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Ser Phe Asp Val Arg Val Gly Val Val Gly Val Glu lie Ser Thr Gly 
260 265 270 

OLu val Val Tyr ulu clu Phe Axn Asp Asa Phe wet p*rg Ser Gly Leu 
275 230 285 

Clu Ala Val lie Leu S©r Leu Ser Pro Ala Glu L*u Leu Leu Gly Gin 

23V 29S 3O0 

Pro Leu Ser Gin Gin Thr Glu Ly& Phe Leu val Ala Mat Ala Gly Pro 
305 310 315 320 

Thr Ser Aaa Val Arg val Glu Arg Ala Ser Leu Aep Cya Phe Ser Aan 
225 330 335 

Gly Asu Ala val Asp Glu Val lie Scr Leu Cys Glu Lye lie Ser Ala 
340 345 3S0 

Gly Asn Lftu Glu A*p Asp Lya Glu Met LyS Leu Glu Ala Ala Glu LyB 
355 360 3*5 

Cly Mftt iSer Cy* Le^ Thr vfal Kis Trxr lie Met Asn Mot- Pr-n His Leu 
310 375 50O 

Thr Val Gin Aid r..eu Ala Leu 7!ir Phe Cys His Leu Lya Gin Phe Gly 
3BS 330 395 300 

Phe Glu Arg lie Leu Tyr Oln Gly Ala &er Phe Arg Ser Leu Ser Ser 
405 410 415 

Asn Thr Glu Met Thr Leu ser Ala Asn Thr Leu «ln Gin Leu Glu Val 
420 425 430 

Vni Lys A»n Asn Sot Asp Gly Ser Glu 6er Gly Ser Leu Phe Hie Asn 
435 440 445 

Met Asn His Thr Leu Ttir Val Tyr Gly Ser Arg Leu Leu Arg K3.9 trp 
450 455 460 

Val Thr His Pro Leu Cya Asp Arg Asa Leu lie Ser Ala Arg Leu Asp 
465 470 475 4fi0 

Ala Val Ser Glu lie Ser Ala Cys Mot Gly Ser Hia Ser 8©r Ser Gin 
4*5 490 495 

Leu Ser Ser Glu Leu Val Glu Glu Gly Ser Glu Arg Ala £le Val ser 
500 505 510 

Pro Glu phe Tyr Leu val Leu Ser Ser val Leu Thr Ala Met Ser Arg 
515 520 525 

Ser Ser Aep lie Glu Arg Gly He Thr Arg lie Pha His Arg Thr Ala 

530 > 535 540 
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ljj Ala Thr Glu P„e nj Ala v*i „et «„'«, rie Leu ^ Aia 

b5 ° 555 560 

Ly.3 Gin He Gin Ax 5 l* u Gly Sle Cya ^ Ajjp ^ ^ ^ 

* 6S 570 57 g 

Ket Gl ° til Tbr V31 Arg * r IS L6U ^ U LVS slo "* Ser 

VM He sex ser Pro val Val v al Asp Aan A3a Qly Lys ^ ^ 
>,s 600 6o5 

Ala Leu Asn Lys Glu Ala, Ala val Arg <n y Asp Leu Leu A6p Ila Leu 

lie Tbr 5 er Ser A*p Gin Phe Pro Glu Leu Ala Glu Ala Arg Ola Ala 

630 



S4P 

val Leu ^ 2l c A r 9 Giu Lys Leu Asp Ser Ser lie Ala Ser Phe Arq 
* 4S "0 6S5 

Lys Ly, K* u Ala I], Arg A, n L«u Clu Phe Leu Cl„ Val Ser Gly lie 
660 670 

Thx His Leu lie Glu Leo Pro v a3 A&p ser Lya Val Pro Hi* Asn TTp 
* ■* s&o sas 

Val Lys Val Asn Ser Thr Lys Lys rhr lie Arg ryr Hie Pro Pro Glu 

Ilj Val Ala Gly Leu Asp Glu Leu Ala Leu Ala Thr Glu Hie Leu Ala 

710 72o 

He val Asn Arg Ala ser Trp Asp Ser P„e Leu Lys Ser Pb* Ser Are 
725 7S0 73S 

Tyr Tyr Tbr Asp Phe Ly* Ala Aia Val Gin Ala Leu Ala Ala Leu Asp 

7 *° 7 *5 750 

cys Leu Hie SBr teu Ser ^ teu SBr Arg ^ Ly ^ ^ 
59 760 765 

Pro Glu Phe v al Asp A*p Cy* Glu Pro Val Glu lie Asn lie Gin Ser 

Gly Arg His Pro Val Leu Glu Thr He Leu Glo Asp Asn Phe val Pro 

7 * D 795 aoo 

Asn Asp Thr He Leu Hie Ala Glu Gly Glu Tyr cy« Gin lie He Thr 

* os aio sis 

Gly Pro Asn Met Gly Gly Ly S 6e * C ya Tyr lie Arg Gin Val Ala Leu 
fl2l> 82S „„• 



SCOCIO: <WO 9919492A2TI > 



WO 



17 

I Lb Ser lie Mot Ala Gin V*l Gly Ser Phe V*l Pro Ala Ser Fhe Ala 
835 840 945 

Lys lieu Hi a Val Leu Asp Gly Val Thr Arg fr3et Gly ALa ser Aap 

950 955 860 

Ser lie Gin His Gly Axg Ser Thr Phe Leu Glu Glu Leu Ser Glu Ala 
ass 875 a&o 

Ser Hl9 lie lie Arg Thr Cys S*r Ser Arg Ser Leu Val lie Leu Asp 
&6S 890 995 

Glu Leu Gly Arg Gly Tbr Ser Thr Hia Asp Gly Val Ala 2le Ala Tyr 
900 9t >S 510 

Ala Thr Leu Gin Hi a Leu Leu Ala Glu LyS Arg Cys Leu Val Leu Phe 
915 920 925 

val Thr Bis Tyr Pro Glu He Ala Glu lie Ser Aen Gly Phe Pro Gly 

935 940 

Ser Val Gly Thr fyr His Val Scr Tyr Leu Thr Leu Gin Lya Asp Lyc 
945 9S0 955 560 

Gly Ser Tyr Asp Bis Aap Asp Vrtl Thr Tyr Leu Tyr 1/ya Leu Va) Arg 
965 $75 

Gly Leu Cys Ser Arg Ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin 
$30 9B5 

lie Pro Pro Ser eye lie Arg Arg Ala Tie Ser Met Ala Ala Lye Leu 
595 1000 1DOS 

Glu Ala Glu val Arg Ala Arg Glu Arg Asn rhx Arg Mot Gly Glu Pro 
iOlO 1015 1020 

Glu Gly His Glu Gla Pro Arg Gly Ala Glu Glu Ser lie Ser Ala Leu 
1025 1030 103S 1040 

Gly Asp Lea Ph* Ala Aep Leu Lye phe Ala Leu ser Glu Glu Asp Pro 
1045 1050 10S5 

Trp Lye Ala Phe Glu Phe Leu Lys His Aia Trp Lys lie Ala Gly Lys 
IPSO 106S 1070 

lie Arg Leu Lya pro Thr Cy3 Ser Phe 
1075 10BO 



<21Q> 20 

<211> 24 

<212> DNA 

<213» Artificial sequence 



slSOOCID: <WO 9919492A2TI > 



<220> 
<22l> 



1 a 

MSB* specific primer 63 8 for PCR vising cDNA of AraMdopsis 
thaliana acotypa Columbia 



<400> 20 
Ctftctaecag gcgacgaaaa accg 

<210> 21 
<211> 29 
<212> DMA 

Artificial sequence 



24 



<220> 
<223> 



Priiner 531 for PCR using cDHA of Arabidopsi* thaliana ecotvoe 
Columbia 2 * 



<4D0> 21 

cgtcgccttt agcatcccct tccttcac * 6 

<210> 22 
<2ll> 30 

<2L3> Artificial sequence 



<220> 
<223:> 



specific primer S323 for PCS vising cdna of ArabidopBis 
Cha liana ecotype Columbia 



<*O0> 22 

gcttggcgca cctaatagaa tcatgacsgg 3Q 

<210> 2} 

<2X1> 24 

<212> DNA 

<2i*> Artificial sequence 



<220> 
<223> 



M5KS specific primer 637 for PCR using cdna of Arabidopsis 
thaliana ecotypa Columbia 



<400> 23 

gacagcgtca gttcttc*g* atgc 24 

<21D> 24 
<211> 35 
<212> DMA 



WO 99/19492 
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19 • - 

^ 21 3 > Artificial sequence 

<230> 

-223> specific primer l£S tor PCR usicig cDNA of AraMdcpais 

Chilian* ecocype coluimbia 

atccogggat gcagcgrcag agatcgai;tt. tgt 33 

Ol0> 25 

<2U> 2? 

<212> DNA 

<213> Artificial sequence 
<220* 

<223> MSH$ specific prinve-.r SP3 for PCR using cDNA of Arabidopeia 
thalicma ecotype Columbia 





25 








w 


cgctatctat 


ggctgct teg 


aattga^: 






27 


<211> 
<212> 

<223> 


2S 

138S 
DMA 

Arab i dope is thai i an a e co type Columbia 


















cccgggacgc 


agegecagag 


atcgattttg 


tctttcttcc aaaaacccac 


ggeggegacr. 


SO 


acgaag^gtt 


tggtttccgg 


egatgetget 


ageggegggg geggcagegg 


aggaccacga 


120 


tctaatgtga 


aggaagggga 


tgetaaagge 


gacgctcctg tacgttttgc 


tgtttcgaaa 


iao 


tetgrcgatg 


aggttagagg 


aacggatact 


ccaccggaga aggetccgeg 


tcgtgtectg 


240 


ccgcetggat 


ttaagcegge 


tgaatccgcc ggegatgett cgfcccctgtt 


ctccaatatc 


300 


atgcataagt 


ttgtaaaagc 


cgatgaccga gactgttctg gagagagga^ 


ccgagaagac 


350 


gttgtcecgc 


tgaatgattc 


atctctatgt 


atgaaggeca atgatgttan 


cccccaactt 


420 


cgtcccaata 


atggtaaaac 


tcaagaaaga 


aaccatgett ctagcttcag 


tgggagagct 


490 


gaacctagat 


cagtagaaga 


tataggagta 


gatggcgatg ctcctggccc 


agaaacacca 


540 


gggatgcgtc 


CRcgtgcttc 


tegcttgsag 


•cjagfctctgg Aggacgaaat 


gacttctaag 


GOD 


gaggataagg 


tccccgtatt 


ggaccctaae 


ar>aaggctga aaatgetcea 


ggatccggtt 


i£60 
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cgr-g^agaga 
atcagggatg 
ataccacctg 
agtgaae^ta 
ctagatgcgg 
aaatgcagac 
gcccgtggat 
agaggtgcea 
agcgagggaa 
gagctacaaa 
ttttgggt tg 
C3ggcLtctc 
gcccc«agga 
gtaatggggg 
aaaggcccrv 
cttag^gctc 
aagcatgggg 

acgacggtaa 
Cacaaatatc 
tgceatccac 
acggcaaact 
gaaagactgc 
ettctgggga 
ttcagaagtrg 
ctctataaae 
tctcaattcg 



^gaaagaagt 
ccaatagang 
atgtcttcaa 
tggacantgt 
aattaggtca 
aggttggtat 
ataaagttgg 
atactica at 
acatcgggcc 
agtgttcaac 
sgtctutcag 
caaaggaagt 
aatatacgcc 
atacagatgc 
cxgaatcatg 
ttggaga<?ct 
atatctdttc 
atcttgagat 
ttgataactg 
tcaaagacgc 
cagaaagtat 
tcggacgcat 
aaaaagtgcr 
gaattgatcc 
tctgtaaacc 
aagcagccat 



aaacgaegga 
acgtcctgat 
gaaaatgtce 
gcctttcctt 
caaggagctt 
ctccgaaagt 
acgaaccgag 
cccaaggaag 
T„gatgccgt.c 
tgtgtatgga 
rrgar.gar.^!:.^ 
gttatac-^ac 
gacagggncc 
tgctggagtt 
gaactgcgct 
aattaatcat 
ataccasgst 
atttaacaac 
tgttagtcca 
agaaagcatc 
gcaaAtcact 
caagtctagc 
gaaacaacga 
gttgccggct 
tcetacatta 
agatagcg 



accaaaxttg 
gatccccctt 
gcaccacaaa 
aaagtgggga 
gactggaaga 
gggatagatg 
cagctagaaa 
ctagttcagg 
catcctcttg 
tttgcttttg 
tcratgtgocg 
agcaaagggc 
acggcggcac 
agaaatataa 
gttgatggtc 
ccgtctaggc 
tacaggggtt 
agccgtgacg 
actggraage 
aatsaacggc 
ggccagtatc 
gttcgatcat 
gteaaagcat 
ctacagaagg 
gtaggaaaaa 



aatggcccga gtcttctrcga 720 

acgatagaaa gaccttacae 750 

agcaaeatfcg gagcgtfcaag 940 

aattttatga gctgtatgag 9 do 

tgac«tgag tggtgtggga 950 

aggcagtgca aaagctatta 1020 

catctgacca agcaaaagcc 1030 

tatcaactco atcaacagca 1140 

ctacaaaaga gatcaaaatg 1200 

ttgactgtgc tgccttgagg 12*0 

ccctcggagc gttattgracg i320 

tatcaagaga agcacaaaag 2 330 

agfcfcggctcc 3gcaccacaa 1440 

tagaatctaa cggatacttt 1500 

taaatgaatg tgatgttgcc L5-50 

taaagctaga agatgtactt 1S20 

gcctcagaat cgatggccag 16B0 

gtggtcctcc agggaccttg 1740 

gaccctcaag gaattggatc 1BD0 

ttgatgtagc tgaagaattc IB 6 0 

tccacaaact tccagactta 1920 

cagcctctgt gttgcctgct 19*0 

ttgggcaaat tgcgaaaggg 2040 

aaeeaaacat gatgagtttg 21.00 

gcgggctaga gttatttctc 3i60 
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2\ 



-:210> 
2 1 2. > 

<2l3* 



27 

L3Q5 

DMA 



ArabicJopsie thaliana ecotype Columbia 
Clong 52 



<400> 



catca^cctc tgtgttgcct gctcttctgg ggaaa&aagt gctgaaacda cgagttaaag 
CatCtgggca aattgtgaaa gggttcagaa gtggaattga tctgttgttg gctctacaga 
aggaatcaaa tatgatgagt ttgctetata aactctgtaa acttcecata tcageaggaa 
aaagcgggcC agagtcatet ctttctcaat tcgzxagcagc cacagatagc gactttccsa 
attatcagaa ccaagatgtg acagatgaaa acgctgaaac tctcacaata cttstcgaac 
tttttatcga aagagcaacc caatggtctg aggtcatcca caecataagc cgectagatg 
tcctgagatc ctcegcaare gcagcaagtc tcectgctgg aagcatggcc aggeccgtr.^ 
tttttrccga stcagaagct acagatcaga ac.cag*a&ac aaaagggcca atacttaaaa 
tccsaggact acggcatcca tttgcagccg ca^ccgatgg tcaattgcct gttccgaatg 
autactcct tggcgaggct agaagaagca gtggcagc3t t: eatccccgg tcacfcgttac 
tgacgggacc aaacacgggc ggaaaatcaa ctcctettcg tgcaacatgt ctggccgtta 
tctttgccca acttggctgc tacgtgccgt gtgagtcttg cgaaatctcc cccgtggata 
etaccctcac aaggcttggc gcatctgata gaatcacgac aqgagagagt ^ccttttrgg 
tagaatgcac tgagacagcg tcagttcttc ag&atgcoac tcaggattca ctagtaatcc 
ccgacgaact gggcagagga ac tag tact t tcgatggata cgccattgca tactcggttt 
tccgtcaccb ggtagagaaa gttcaabgtc ggatgc*ctt cgeaacacat taccacectfc 
tcaccaagga attcgcgtct cacccacgtg tcaccccgea acacatggct tgcgcattca 
aatcaagatc tgattatcaa ccacgtggtt gtgatcaaga cctagtgttc ttgtaccgtt 
taaccgaggg agcctgtcct gagagctacg gacttcaagc ggcactcatg gctggaatac 
caaaccaagt ggttgaaaca gcatcaggtg ctgctcasgc catgaagaga tcaattgggg 
aaaacttcaa gtcaagtgag ctaagatctg agttetcaag tctgcatgaa gactggccca 
agtcattggt gggtatttct cgagtcgccc acaacaatgc ccccattggc gaagatgact 
acgacacttt gttttgctta tggcatgaga tcaaatcccc ttactgtgtt cccaaataac 



SO 
120 
130 
240 
300 
360 
4lo 
480 
540 
500 
£50 
120 
7B0 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1390 



WO 99/3 94 92 



ccggg 



<210> 2B 



22 



<2\2> DffA 

<213* Artificial sequence 



<220> 
<223> 



KSK6 specific primer 288 for pcr using cDNA of ArabldoMi* 

thaliana ^^vad rw-ii._w.2-. w ^* i,s 



t ha liana ecotype Columbia 
<400> 2€ 

atcccgggtt atttgggraac *cagtaagag gatt 



<2i0> 2$ 

<211> 27 

<212> DNA 

<213> Artificial sequence 



3* 



<22C> 
c223> 



»SHG specific primer S82 for pca uaing cDWA of Arabidopaia 
tbalian* ecotyro Columbia 



:otype Columbia 

gcgttcgatc atcagceect gcgttgs 27 

<211> 3606 

*212> DWA 

<213> Arabittopnls thallana ecotyp* Columbia 
<22fl> 

<221> CDS 

<225> U42> 04€91 

<223> AtWSH* full-length cona and dertucad sequence or the encoded 
polypeptide 

<400> 30 

aaasgttgag ccetgaggag tatcgtttcc gccatttcta cgacgcaagg cgaaaatttt so 
tggcgccaat ccttccccee tttcgaatcc ectcagetea aaacatcgtt tcectctcac 120 

tctctctcac aattccaaaa a atg cag cgc cag aga teg att ttg cct ttc 171 

Met Gin Arg Gin Arg ser He uu Ser Phe 
1 5 l6 



-»nrrrir> <wo 
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ttc caa aaa ccc ace gcg gcg act acg aag ggt ttg gtC tec ggc gat 213 
Ph* Gin Lye fro Thr Ala Ala Thr Thr Lye Gly Leu Val Ser Gly Asp 

15 20 25 

get get age ggc ggg ggc ggc age gga gga cca cga tct aat gtg aag 257 
Ala Ala 3*r Cly Gly Gly Gly ser Gly Giy Pro Arg Phe Asn Val Arg 
30 35 40 

gaa ggg gat get aaa ggc gac get tct gta cgt ttt get gtt teg aaa 315 
Glu Gly Aap Ala Lys Gly Asp Ala Ser Val Arg Phe Ala Val Ser Lya 
45 SO 55 

tct gtc gat gag gtt aga gga acg gat act cca ccg gag aag gtt ccg 35* 
Ser Val Asp Glu Val Arg Gly Thr Asp Thr Pro Pro Glu Lya Val Pro 
60 6S 70 

cgt cgt gtc ctg ccg tct gga ttt aag <;cg get gaa tec gec gsfc gat 411 
Arg Arg Val Lou Pro Sex Gly Phe Lys Pro Ala Glu Ser Ala Gly Aap 

75 ao es 90 

get teg tec etg tec tec aac ate atg eat aag ttt gta aaa gtc gat 459^ 
Ala Sfer ser Leu Phc S*r Aj;r. ; w^r. itis Ly& Phe Val Lyc val Asp 

10O 105 

gac ega gat tgt icz gga gag agg age cga gaa gat gtt gtt ccg ctg 50"? 
Asp Arg Aep Cys Ser Gly Glu Arg Bar Arg Glu Asp Val Val Pro Leu 
HO 115 120 

aat gat tea tct cca cgt atg aag get aat gat gcr att cct caa ttt 5SS 
Asn Aep Ser Ser Leu Cys Met Lys Ala Aaa Asp Val He Pro Gin Phe 
125 130 L35 

cgt tec aat aat ggt aaa act caa gaa aga aac cat get ttt agt ttc 603 
Arg Ssr Asa Aaa Gly Lys Thr Gin Glu Arg Aen His Ala Phe Ser Phe 
140 14S 150 

939 ^ga get gaa ctt aga tea gta gaa gat ata gga gta gat ggc 6 SI 

Ser Gly Arg Ala Glu Leu Arg Ser Val Glu Asp 11% Gly Val Aap Gly 
155 160 1*5 170 

gat gtt cct ggt cea gaa aca cca ggg atg cgt cca cgt get tct cgc 699 
Aep Val Pro Gly Pro Glu Thr Pro Gly Met Arg Pro Arg Ala Ser Arg 
175 1B0 1S5 

ttg aag cga gtt ctg gag gat gaa atg act ttt aag gag gac aag gtt 7*7 
Leu Lya Arg val Leu Glu Asp Glu Wet Thr Phe Lye Glu Aap Lys val 
190 1S>5 200 

cct gta teg gac tct aac aaa agg ctg aaa atg etc cag gat ccg gtt 7 95 

Pro Val Leu Aep Ser Asrt Lys Arg Leu Lya Met Leu <31n Aap Pro Val 
205 210 215 
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t^t gga gag «ag aaa gaa gta *nc <?aa g§a acc a3a ttt tgg ctt 943 

eye Gly Glu Ly* Lye Glu vaL Asi> Glu Gly Thr Lye Phe Glu Trp L<=u 
220 225 230 



gag tct cc: cga acc agg gat; gcc aat aga aga cgt cct gat gat ccc 

Glu ser Arg lie Arg Asp Ma Asn Arg Arg Ar^ Pro /Up Asp Pro 

240 245 2S0 

etc tac gac aga aag acc tea cac ata cca cct gat get ttc aag aaa 

Leu Tyr Asp Arg Lys Thr Leu His lie Pro Pro Aap Val Phe Lye Lya 
255 260 265 



391 



£39 



atg tct gca tea caa aag caa tat tgg age gtt aag age gaa cat atg 987 
Wet Ser Ala £er Gin Lya Gin Tyr Trp Ser Val Lys Ser Glu tyr Mat 

270 275 280 

gac att gig ctt ttc ttt aaa gT.g ggg aaa ttt tat gag ctg tat gag 1035 
abp 11* val Leu Phe frhe Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu 

20O 295 

» 

eta gat gC9 gaa tta ggt cac auy gag csc gac tgg aag atg acc atg 100 3 

J-eu Asp Ala Clxt Leu Sly Kis Lys Glu Leu Asp Trp Lya Met Thr Wet 
303 ICS 310 

agt ggt gvg gga aaa tgc ag« rag gt!; ggt ate tct gaa agt ggg ata H3i 
Sex Gly Val Gly Lys Cys Arg Gin Val Gly He 5er Glu Ser Gly He 
315 320 325 330 

gat gag gee grg caa aag eta tta gcc cgt gga tat aaa gtt gga cga 1179 
Agp Glu Alii Val Gin LyB Leu Leu Ala Arg Gly Tyr Lys Val Gly Arg 
3 35 34G 34 5 

ate gag ca? eta gaa aca tct gac caa gca aaa gcc aga ggt get aat 122? 
He Glu Gin Leu Glu Thr Ser Asp Gin Ala Lys Ala Arg Gly Aid Asn 
350 355 350 

act ata att cca agg aag eta gtt cag gta tta act cca tea aca gca 1275 
Thr lie He Pro Arg Lya Lau Val Gin Val Leu Thr Pro Sar Thr Ala 
3«5 370 375 

age gag gga aac ate ggg cct gat gcc gtc cat ctt ctt get ata aaa 13 23 

Sar Glu Gly abh rie Gly Pro Asp Ala Val Hia Leu Leu Ala He Lya 
38 0 385 350 

gag ate aaa atg gag eta caa aag tgt tea act gtg tat gga ttt get 1371 
Olu He Lya Met Glu Lau Gla Lys Cys Ser Thr Val Tyr Gly Phe Ala 
395 400 405 410 

ttt gtt gac tgc get gcc ttg agg ttt tgg gtt ggg tec ate age gac 1419 
Phe Val Asp Cya Ala Ala Leu Arg phe Trp Val Gly Ser He Ser Aap 
415 420 425 
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gc3 tea cgt gcc gen ctt gga gcg tta ttg atg cag gtt tct cca 1467 

Asp Ala ser Cys Ala Ala Leu Gly Aid i/SU Leu Met Gin Val Sar Pro 
*3D 435 440 

aag gaa gtg ct3 Lac gac agt aaa ggg eta tea aga gaa gca caa aag 151S 
Lye Glu Val Leu Tyr Asp Ser Lye Gly Leu Ser Arg Glu Ala Gin Lys 
445 450 4S5 

get eta agg aaa tat a eg ctg aca ggg tct acg gcg gta cag ttg get 156 3 
Ala Leu Arg Lya Tyr Thr Leu Thr Gly Bar Ttir Ala Val Gin LaU Ala 
460 4*5 470 

cca gta cca cea gca atg ggg gat aca gat get get gga gtt aga aat 1611 
pro val Pro Gin Val Met Gly Asp Thr Asp Ala Ala Gly Val Arg A*n 
475 4SO 485 490 

ata ata gaa tcb aac gga eac ttt aaa ggt tct tct gaa tea egg aac 165 9 

lift Tift Glu £er Asn Gly Tyr Phc Lys Gly Ser Ser Glu Ser Trp Asn 
-i$S 500 50S 

tgt gee gtt gat ggt eta aat gaa cgt gat get gee etc agt get ctt 170*7 
Cys Aid Val ftsp Gly Leu Asr. Glu Cys? Asp Val Ala Leu Ser Ala Leu 
f.10 Si 5 520 

gga gag eta act aat; cat etg tct agg eta aag eta gaa gat gta ctt 1755 
Gly Glu Leu lie Abji His Leu Ser Ax?g Leu Ly* Leu Glu Asp Val Leu 
525 53 0 53 S 

aag cat ggg gat att ttt cca tac caa gtt tac agg ggt tgt etc aga 1303 
Lys His Gly Aap lie Phe Pro Tyr Girt Val Tyr Arg Gly Cys Leu Arg 
540 545 550 

att gat ggc cag acg atg gta aac ccx gag ata ttt aac aat age cgt iB5i 
lie Asp Gly Gin Thr Mat Val Asn Leu Glu lie Phe Abji Asn Ser Cys 
555 5*0 565 570 

gat ggt ggc cct tea ggg acc ttg tac aaa tat ctt gat aac tgt gtt IB 9 9 

Asp Gly Gly Pro 3* r Gly Thr Lbu Tyr Lya Tyr Leu A&p Asn Cys Val 
575 580 535 

age cca acc ggc aag cga etc tta agg aat tgg ate tgc cat cca etc 1947 
Ser Pro Thr Gly Ly* Arg LaU Leu Arg Aan Trp lie Cys Hi a Pro Leu 
590 595 600 

aaa gat gta gaa age ate aat aaa egg ctt gat gca gtt gaa gaa ttc 1995 
Lya Asp val Glu ser 11* Asn Lys Arg Leu Asp Val Val Glu Glu Pha 
605 610 615 

acg gca aac tea gaa agt atg caa ate act ggc cag tat etc cac aaa 204 3 
Thr Ala Asn Ser Glu 6 r wet Gin He Thr Gly Gin Tyr Leu His Lys 
£20 625 630 
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ctt cca jac tea gaa aga ctg etc gga cgc ate sag tot age gtt "qa 

Leu Pro Aep U U clu Arg Leu Leu Gly Arg tie Lve Ser Ser Val Ar* 

* 35 640 «4S ' £S Z 

Cca tea gee tct gtg Ctg cot get ctt ctg ggg aaa aaa gcg ctg aaa 
Ser ser Ala ser Val Leu Pro Ala Leu Leu Gly Lys Lys val Leu Ws 
" 5 660 465 

caa cga grc »a gca ttt ggg caa att gtg aaa ggg ttc aga age qoa 
Gin Arg val Ly* Aja Phe Gly Gin lie val Lys Gly Phe Arg sir Gly 
670 675 680 

att gat ctg teg tCg gee eta cag aag gaa tea aat aeg aeg agt etg 2^2S 
lie Asp Leu L*u Leu Ala Leu cm Lys Glu eer Asn Mac Met ser Leu 

66s 690 635 

ctt tat aaa etc e g t aaa cct cct aca tea gca gga aaa age ggg cea 
Leu Tyx Lya Leu Cys Lys Leu Pro lie Leu Val Gly Lys Ser Gly Leu 
700 705 71ri 



2091 



2139 



2137 



2283 



2331 



2379 



242? 



247S 



gag eta ett ctt tct caa ttc gaa gca gee ata gac age gac ttc cca 
nlu Lau ?he leu s ftr Gin t>he Glu Ala Ala lie Asp ser Asp «>he Pro 
715 720 m " 730 

aat cat cag aae caa gac geg aca gat gaa aae get gaa act etc aca 
Aen Tyr Rln M» Gin Asp Val rhx Asp Glu Aan Ala Glu Thx Leu Thr 

740 745 

aca cct ate gaa ctt ttt ace gaa aga gca ace caa egg tee aag gee 
lie Leu lie Glu Leu Phe lie Glu Arg Ala Thr Gin Trp Ser Glu Val 
750 755 7CO 

act eae ace ata age tge eta gat gtc ctg aga tct ttt gca ate gca 
lie Has Thr lie Ser Cys Leu Asp val Leu Arg ser Phe Ala He Ala 
765 770 775 

gca age etc tct get gga age aeg gec agg cct get ate tct ccc gaa 2523 
Ala S«r Leu S Br Ala Gly Ser Mac Ala Arg Pro Val lie Phe Pro Glu 
780 76S 790 

tea gaa get aca gat cag aae cag aaa aca aaa ggg cca ata etc aaa 35 7i 
Ser Glu Ala Thr Asp Gin Asa Gin Lys Thx Lys Gly Pro lie Leu Lys 
795 S00 8 05 

*?f ri« ?? a f Ca t5g C f t CCa tCt gca 9tC * ca * cc sat ggc caa ttg 
lie Gin Gly Leu Trp His Pro Phe Ala Val Ala Ala Asp Gly Gin Leu 

815 820 B25 

cct gtt ceg aat gat aca ccc ett ggc gag get aga aga age age ggc 2<SS7 
Pro Val Pro Asa Asp He Leu Leu Gly Glu Ala Arg Arg Ser Ser Gly 
930 835 840 



WO 99/ J 9491 PCT/teP9Jtfl>6977 

age att cat cct egg tea ctg tea ctg acg gga cca aac atg ggc gga 2715 

Ser He His Pro Arg Ser Leu L*u Leu Thr Gly Pro Aan Met Gly Qiy 
345 SSD &5S 

aaa tea act cct etc car. gca aca tgt ctg gee gtt ate ttc gec caa 276$ 

Ly* ser rhr Leu Leu Arg Ala Thr Cys L*u Ala Val lie Phe Ala Gin 
860 $65 &70 

ctt ggc tgc tac gtg ccg tgt gag tct tgc gaa ate ccc etc gtg gat 2311 

Leu Gly Cys Tyr val Pro Cys Glu Ser Cy* Glu lie Ser Leo Val Asp 

97* 380 B35 330 

act ate tec aca agg ctt ggc gca tec gat aga ate atg aca gga gag 2S5 9 

Thr He Phe Thr Arg Leu Gly Ala Ser Asp Arg I la Mat Thr Gly Glu 
895 900 90S 

age acc Ctt ttg gta gaa tgc act gag aca gcg tea gtt ctt cag ait 2907 

Ser Thr Phe Leu Val Glu Cy« Thr Glu Thr Ala Ser Val Leu Gin Asn 
910 915 920 

gca act cag gat tea <:ta gta ar.c ^tc gac gaa ccg ggc aga ggai act 2955* 

Ma Thr Gin Asp Ser Leu va; He Leu Asp Glu i-ev Oly Arg Gly Thr 
D2S £30 935 

agt act ttc gat gga cac gec act gca tac teg get ttt cgt cac ccg 3003 

Ser Thr Phe Asp Gly Tyr Ala lie Ala Tyr Ser Val Phe Arg His Leu 
940 945 950 

gta gag aaa get eaa tgt egg atg crtc ttt gca aca cat tac cac ccc 3 0S1 

val Glu Lya Val Gin CyS Arg Met Leu Ph<2 Ala Thv Hia Tyx riis Pro 

955 960 965 970 

etc acc aag gaa tte gcg tct cac cca cgs. gtc acc teg aaa cac atg 309£ 

Leu Thr Lye Glu *>he Ala Ser His Pro Arg Val Thr Ser Lye Hie Met 
976 SBO 935 

get tgc gca etc aaa tea aga tct gat tat caa cca cgt ggt tgt gat 3147 

Ala Cys Ala P*» Lys Ser Arg Ser Asp Tyr Gin Pro Arg Gly Cys Asp 
*90 995 1000 

caa gac eta gtg ttc ttg tac cgt eta ace gag gga get tgt cct gag 3195 

Gin A*p Leu Val Pbe Leu Tyr Arg Leu Thr Glu Gly Ala Cys Pro Glu 
1005 1010 10X5 

age tac gga ctt caa gtg gca etc atg get gga ata cca aac caa gtg 3245 
Ser Tyr Gly Leu Gin Val Ala Leu Met Ala Gly He Pro Asti Oln Val 
102D 1025 1030 

gtt gaa aca gca tea ggt gec get caa gec atg aag aga tea ate ggg 52 92. 

val Glu Thr Ala Ser Gly Ala Ala Gin Ala Wet Lya Arg Ser lie Gly 

1035 104D 104S 1050 



Monnrrirv ,wn ooio-»ew»-rri ^ 



WO 99/19*91 

FCTyRP9S/06977 

T C £1" f** 3gt 3 ** C " a aga tCt ttc tea agt ctg oat 333, 

Olu te„ Ph. Lys Ser Ser «* A rg ser Glu Phe Sex Ser Leu £a ? 

J0SS "SO 10 65 

gaa gac tgg etc aag cca teg gtg ggt att tct cga gtc gec cac aac 
OIU Aap frp i™ Lya Ser Leu Val Gly II* 3or A ? g ?.l JJJ hH J£ 

1070 10 *5 aoeo 

aac gec ccc ate ggc gaa gat gac tac gac act ttg ttt tqc fcfca tea 

"MS U6 ABP ASP W A9 * Th * «*2 ^ cya Leu S 

lDaS 1090 1095 *^ 

cat gag ate aaa ccc tct tac tgt gee ccc aaa taaatggcta > 47a 
Kls Glu He Lys Ser Ser Tyr Cya Val Pro Lya 4 3 

1100 ll05 

tgacataaca etaCctgaag ctcgttaagt cttetgcctc tctgatgttt attcctcct* 353a 
aaaaaegctt atatatcaaa wauoi v.v r.e 9 «»iM aaaaaaaaaa aaaaaaaaaa »M 
aaaaaaact 



<210> v. 

<2ll> H09 

<2i2j» prt 

*2i3> Aj-a^itfop^i* eyj*i i*™* «cotype Columbia 

<223> Polypeptide MSH6 

<400> 3i 

Met Gin Arg Gin Arg Ser He Leu Ser Phe Phe Gin Lya Pro Thr Ala 

Ala Thr Thr Lys Gly t*u Val Ser Gly A B p Ala Ala Ser Gly Gly Gly 

20 - 25 30 

Gly ser Gly Gly pro Arg Phe Aan Val Arg Glu Gly Aap Ala Lya Gly 
35 40 45 

Asp Ala ser Val Arg Phe Ala val Ser Lya ser v*l A3p CJu Val 

50 S5 



60 



Gly Thr Aap Thr Pro Pro Glu Lys Val Pro Arg Axg Val Leu Pro Ser 

5 5 •*'••■ — 



70 75 



SO 



Gly Phe Lys Pro Ala Glu Ser Ala Gly Aap Ala Ser Ser Leu Phe Ser 

*S 90 95 

*an He Met Hia Lya Phe Val Lya Val Asp Asp Arg A p Cys Ser Gly 
100 1©S no 

Glu Arg Ser Arg Glu Asp Val val Pro Leu Aan Aap ser ser Leu Cya 
115 130 12S 



3387 



3435 



3605 



crw-irv ^uur\ 



WO 99/1*491 



29 



Met Uys Ala Asn Asp vaL He Pro Glu Phe Arg Ser Asn Aan Gly Lys 
130 135 140 

Thr Gin Glu Arg Ami His Ala Phe Ser Phe Ser Gly Arxj Ala Glu Leu 
145 ISO 1SS ISD 

A*g Ser vai Glu Asp lie Gly Val Asp Gly A*p val Ero Gly pro Glu 
1SS 17D 175 

Thr Pro Gly Met Arg Pro Arg Ala S«r Arg Leu Lys Arg Val Leu Glu 
180 185 190 

Aep Glu Met Thr Phe Lys Glu Asp LyS Val Pro Val LeM Asp Ser Asn 
1$5 204) 205 

Lys Arg Lau Lys Met Leu Gin Asp Pro Val Cys Gly Glu Ly» LyB GVu 
210 215 220 

Val Asn Glu Gly Thr Lys Phe Giu Trp Leu Clu Scr Sex Axg lie Arg 
225 230 23S 240 

Asp A la A/iH Arg Arg Arg Pro Asp Aop Pro Leu Tyr Asp Arg Ly& Thr 
245 250 2S5 

Lou His He Pro Pro Asp Val Phe Ly» Lys Met ser Ala Scr Gin Lye 
2SO 265 270 

Glp Tyr Trp Ser VaI Lye Ser Glu Tyr Met Asp lie Val Leu Pbe Phe 
7.75 7BQ 285 

Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu Leu Asp Ala Glu Leu Gly 
29D 295 300 

His Lys Glu Leu Asp Trp Lys Met Thr Met Sex Gly Val Gly Lya Cys 
305 310 215 320 

Arg Gin VAl Gly rid Ser Glu Ser Gly lie Aap Glu Ala Val Gin Lys 
*2S 330 335 

Leu Leu Ala Arg Gly Tyr Lys> val Gly Arg Tie Glu Gin Leu Glu Thr 
340 345 350 

Ser Asp Gill Ala Ly» Ala Arg Gly Ala Aati Thr lie He Pro Arg Lys 
3 55 350 365 

Leu Val Gin Val L*u Thr Pro ser Thr Ala $er Glu Gly Aim lie Gly 
370 375 3B0 

Pro Aep Alfr Val His Leu Leu Ala lie Lys Glu lie Lya Met Glu Leu 
38S 390 395 400 

Gin Lya Cya Ser Thr Val Tyr Gly Phe Ala Phe Val Asp Cys Ala Ala 
405 41D 415 



WO 99/19492 



Leu Axg Phe Trp val cly ser lie Ser ABp ^ Ala Sur Cys ^ ^ 

425 

L*u Gly Ala Leu Leu Mc Gin Val S * r p ro ^ ye Glu Val Leu Tyr 

440 4 4S 



Met 
80 



»r Lys Gly Leu Ser Arg Glu Ala Gln Lys ^ ^ ^ Lyg ^ ^ 

Leu Thr Gly Ser Thr Ala Val 01. Le U Ala Pro Val p rD ein Vei M 

Gly Asp T nr ABp A ^ Ala Qly Val ftrg A£n g ^ ^ 

465 450 49S 

Tyr Phe Ly B Gly Ser Ser Glu Ser rrp Asn Cya Ala Val Aep Gly Le u 

505 51 q 

Aan Glu Cys * 8p v al A l« Leu Ser Ala Leu Cly Glu Leu He A*» B l. 

515 520 525 

Leu Ser Arg Leu Ly* Le U C * u Asp Val ^ Lys ^ Gly Afip ^ ^ 

535 S40 

PVO Tyr Cl„ Val ryx Arg Gly cyo Leu Arg ,1* Asp Cly Gin Thr Met 

550 555 5(5D 

Val Aen Leu Glu lie Phe Aan Asn Ser Cys Asp Gly G].y Pro Ser Gly 

S70 ' 575 

Thr Leu Tyr Lys Tyr Leu Asp Asn cya Val Ser vro Thr cly Ly 0 A rg 
S6 ° 590 

Leu Leu Arg A*n Trp lie Cya Hie Pro Leu Lya Asp Val Glu Ser Ii 
595 **>0 60S 



e 



615 S20 



Aax* Lya Axg Leu Asp Val Val Glu clu Phe Thr Al* Asn ser Glu Ser 
Met Gin iu Thr Gly em Tyr Leu Hie Lya Leu Pro Asp Leu Glu Arg 

630 53S 64(> 



Leu Leu Gly Arg n e Lys Ser 6er Val 



Arg Ser Ser Ala Ser Val Leu 

6S5 



Pro Ala Leu Leu Gly Lya Lya Val Leu Lya Gin Arg Val Lya Ala Phe 



Gly Gin lie Val Lya Qly Pne , A rg Ser Gly 1 1 Aap Leu Leu L*u Ala 
675 680 5a5 

Leu Gin Lys Glu ser Aan Met wee ser Leu Leu Tyr Lya Leu Cya Lys 



700 



;nnr.r> , W r> oco-oo.-m - 



WO W19492 



PCT/fcP98/O6077 



3f 



Leu Pro lie Lea Val Gly Lys Ser 
705 7*o 

Phe Glu Ala Ala lie Asp Ser Asp 
725 

Val Thx Asp Glu Aan Ala Glu Thr 
740 



Gly leu Glu Leu t>he Leu Ser Gin 
71& 720 

Phe Pto Asn Tyr Gin Asn Gin Asp 
75o 735 

Leu Thr lie Leu lie Glu Leu Phe 
745 750 



II* Glu Arg Ala Thx Gin Trp Sex Glu Val tie Kia Thr lie Ser Cys 
75S 76 0 76 5 

Lau Asp Vul Leu Arg Ser Phe Ala lie Ala Ala Ser Leu Ser Ala Gly 
7?0 775 780 



Ser Met Ala Arg 
785 

Asn Gin Lys Thr 



Pro val lie Phe 

7 90 

Lys Gly Pro 
305 



Pro Glu Ser GLu 
79S 

Leu Lyc He Gin 
610 



Ala Thr Asp Gin 
soo 

Gly Leu rxp wis 
915 



Pro Pi to Ala val 
B20 

Leu Leu Gly Glu 
935 



Ala Ala Asp Sly 

Ala Arg Arg Ser 
640 



Gin lau Pro Val 
Ser Gly Ser lie 



Pro Aao Asp lie 
830 

His Pro Arg Ser 

84S 



Leu Leu Leu Thr Gly Pro Asn Met Gly Cly LyS S«r Thr Leu Leu Arg 
$50 B5S &60 

Ala Thr Cy* Leu Ala Val He Phe Ala Gin Leu Gly Cys Tyr val Pro 

870 675 

Cys Glu Ser Cys Olu Tie Ser Leu Val Asp Thr Zle Phe Thx Arg Leu 
BBS 830 395 

Gly Ala Ser Asp Arg lie Met Thr Gly Glu Ser Thr Phe Lsu Val Glu 
$00 $05 $10 

Cys Thr Glu thr Ala Ser Val Leu Gin Asn Ala Thr Gin Asp Ser Leu 
315 920 925 

Val He Leu Asp Glu Leu Gly Arg Gly Thr Ser Thr Phe Asp Gly Tyr 
$30 935 940 

Ala lie Ala Tyr Ser Val Phe Arg His Lou Val Glu Lya Val Gin Cys 
945 950 355 960 



Arg Met Leu Phe Ala Thr Mia Tyr His Pro Leu Thr Lya Glu Phe Ala 

965 970 9-75 

Ser His Pro Arg Val Thr Ser LyS His Ket Ala Cys Ala Phe Lys Ser 
$80 985 990 



32 



PCT/EF9S.'W977 



Arg Ser Asp Tyr Gin Pro Aro* Gly Cys Asp Gin Asp Leu V*l Phe Leu 
9*5 1000 LOOS 

Tyr Arg Leu Thr Glu Gly Ala Cys Pro Gl-j ser Tyr Gly Leu Glo Val 
1010 lOtS 1020 

Ma L*u Met Ala Gly He Pro Aan Gin Val Val Glu rhr Ala Ser Gly 
1025 1030 *D35 1040 

Ala Ala Gin Ala M*t Lys Arg Ser He Oly Glu Asa Phe Lye Ser Ser 

1D50 1055 

Glu Leu Arg Ssr Glu Phe Ser Sox Leu His Glu Aap Trp Leu Lya S©r 
10*0 106S 1070 

L*u val Gly lie Ser Arg val Ala His Asn A*a Ala Pre lie Gly Glu 

1075 1 OB D 1065 

Asp Asp ryr Asp Thr i**u vne Cys U-a Trp His Glu He Lys* £er Scr * 
1090 109S 1100 

Tyr cya Val Pro i.ya 
1105 



<210> 
<21l> 

<212> 
^2l3> 


32 
24 
OKA 

Artificial sequence 


<220> 
<223> 


Forward primer for pcfc 
Ttticrooatellite 


<400> 


32 


accatgesta 


gcrtaaactt cttg 


<210> 
<211> 
<212> 
<213> 


33 
22 
DSA 

Artificial sequence 


*220> 
<223> 


Reverse primer for PGR 
mi croaatellite 


<400> 




acataaccac 


aaataggggt $C 



24 



22 



WO 99/1*492 



PCT7EPWD6$n7 



33 

<210.> 34 
<2ll> ia 
<2l2> DMA 

'-21 3 > Artificial sequence 

<220> 

<22 3> Porvard primer DMCIN-A for PCR on genomic DMA of Arabidopsis 

thai i ana esp, Landaberg eracta R Ler° 

<400> 34 

gaagc^atat cqttcgtg 13 
c2io> 

<211> 13 

<212> DMA 

<213> Artificial sequence 

<220> 

<m> jievereu primer DMCIN-B for PCR on genomic dna of Arabidopa^s 
thaliana ssp. Lancisb^rg erecta H Ler" 

<4D0> 35 

sgarcgcgag aacaitcc 13 

*2iD> .1$ 

<211> 31 

<212> DHA 

<2iS> Artificial sequence 
<220> 

<223> Forward primer Z>MCIff-l tor PCR on genomic dna of Arabidopeis 
ChaXiana Bsp, Landaberg erecta "Ler H 

c*00> 36 

acgcgtcgac tcagctatga gattactcgt g 31 

<210> 31 

<2il> 2^ 

<2l2> DNA 

<213> Artiticial sequence 

<220> 

*233> Reverse primer DMCIM-2 for PGR on genomic DNA of Arabidopeia 

tftaliana asp. Landsberg ererta "ler" 

<400> 37 

gctctagatc tcccgctcta agactctct 29 



JSDOCIO. <WO 9919492AZTI > 



WO 99/1 9492 



34 



^21Q> 33 

<211> 32 

<212> DMA 

<2L3> Artificial sequence 



<220> 
<223* 



Forward primer DMCIWO for PGR on 9 *nomic toa of Arabidopeia 
tbalxana osp. Landeberg erecta »Ler- *^i«^ia 



c40ft> 34 

gctcta$agc ttctcttaag taagtqattg at 

<2i0> 39 

<2U> 48 

<2l2r- DMA 

* 2 "> Artificial sequence 



32 



r„iA*iar.a sap. -»andabe-rg erecta "Ler" 



<40l)> 33 

cceecc*99« t*?A9*9**e tceicggttt cttcagctci atgaatcc 48 

<2l0> ^0 
*2llr» 26 

<213> Artificial sequence 



<22D> 

<223> 



Forward primer I>MCla for PCR on gonomic ONA of Arabidopei* 
tneiiana asp. Landaberg erecta M Ler* 



acgcgtcgac gaattcgcaa gtgggg 



2$ 



<211> 38 
c2l2> DNA 

<213> Artificial sequence 



<220> 



Reverse primer DNClb for pol on genomic DNA of Arabidopaij 



thai lane eap, Landsberg erecta m xmr* 

<4Q0r 41 



"XTOtf> <Wn OQ1Q4Q9A-7TI -v 



WO 9M W92 PC17EFIW9697T 

35 



rceatggaga 


tct cccgggt ajscrga tct^e ttcgaggg 


3fi 


<212> 
<2I2> 
--213> 


42 
20 
DNA 

Artificial se quence 




i22Q > 


Forward priTner for PCR amplif i cation of ATbati 
Ar-abidops is thaliana eubapeciee 


SSLP marker in 


<40G> 


42 




gccaccgcgt 


gsatgatatg 


20 


<210> 

<2ii> 

•:2*12> 
<213> 


43 
22 
DWA 

Arti f icial sequence 




*, a v •> 

<223> 


Reverse primer for PCR anipl i f icacicm of ATEAT1 
Arab i dope is thaliana subspecies 


SSLP marker in 




43 




cgaacagcca 


scattaattc cc 


22 


<.210> 
<211> 

<213> 


1& 
DMA 

Artificial sequence 




<220> 
<223> 


Forward primer for PCR amplification of SSLP marker in 
Arabidopaia thaiiaaa subspecies 




44 






cagaagcg 


10 


<21Q> 
<211> 
<212> 
<213> 


4S 
15 
DNA 

Artificial sequence 





*220> 

<223> Reverae primer for FCR araplif icat: ion of WGAS3 sslp marker in 

Arabidopaia thaliana aubfipaeies 



tNSDOCID: <WO 9919492AZTI_> 



J 

WOW19492 (']' t 

acccaagtga tcgccacc 



<310> 4$ 

<211> 22, 

<2l3> DKA 

<213> Artificial sequence 



<220> 

<223> 



taccgaacca asacacaaag g 

<2l2> &NA 

<213^ Artificial sequence 



21 



<230> 
<223> 



<400> 47 

Cctgtatctc ggtgaattct cc 

<211> 22 
<212> DNA 

«2l3> Artificial aequence 



22 



<220> 
<223> 



<400> 4Q 
ggtctgttga cgtcgtaagfc eg 

<210> 49 

<211> 2 2 

<212> DWA 

<*13* Artificial sequence 
<220> 



22 



^SOOCID: <WO 9919492A2TL> 



WO 99/1 9492 PCTYEPiWO^ *?7 

37 

<2 2>> Reverse primer for PCR ampi Lf ication of* NGA12 8 SSLP marker in 

Arabidopsia thaLiana subspecies 

^ DO ? 49 

stcttgaaac ctcta-gggag gg 2 -? 



<:2l0> 50 

<21l> 22 

<212> DMA 

<213> Artificial sequence 
<220> 

<223> Forward primer for pcr acapii f i cation of NCA2SO sslp marker in 
ArabictopeiB thai i ana subspecies 



22 



<400> 


SO 


etgatctcac 


ggacaauagc gt 




51 


<211> 


20 


<212* 


DATA 


<213> 


Ar ti f icial sequence 


<22D> 




<223> 


Reverse primer far FCR amplif Icavian 




Ar-abidopeis thaiianA subspecies 


<400> 


Si 



ggctccatsa aaa9cgcB.cc 20 



<210> 


52 


*2ll> 


21 


<212> 


UNA 


<213> 


Artificial sequence 


<220> 




<222> 


Forward primer tor PCR amplification 




Arabidep£i& ti*a liana euJbapecies 


<400> 


52 


ctccagttgg aagctaaagg g 

V 


c2IQ> 


S3 


<211> 


21 


<212> 


OKA 


<2l3> 


Artificial sequence 



in 



21 



JNSDOCID: <WO 9919492A2TI_> 



W0 99V194JK 



38 



<,220> 

<40b> 53 
tgttttttag gacaaatggc g 



21 



2 1 D > 54 
<;211> 20 



<2l2> p W A 

*213> Artificial sequence 



c22Q> 
*223> 



JSSSJr^TS fr PCR ^P 1 " 1 *""* «>* 9 sslp marker in 

Arabadopais thaliana subspecies 



<400> 54 
ccttcacacc caaaacccac 

<210> ss 

c2ll> ao 

<212> DNA 

c213> Artificial sequence 



<220> 



Ara^idopaiB t ha liana Gubepecies 



«c40D> 55 
gcacataccc acaaccagaa 

<21D> 55 

<212> 20 

<212> DNA 

<213>. Artificial aequence 



20 



<220> 

<223> 



r^T r f.S rlTn " £ ° r PCR aro P li£ Action of Majuue SSLP marker 
m Arabadopaia thaiiana subspacies 



<400> 5$ 

cgccacgctrt ttcggtaaag 



20 



^SOOCID: <WO 9919492A2TL> 



WO 99/19494 

<210> S7 
<211> 20 
<212> OWA 

<:Uf Artificial sequence 

*22 3> R«v«rsa primer for PCR amplification of NGAH26 marker 

in ArabidooBis chaliana subspecies 

<4O0> 57 

$c&cagtcca agtcacaacc 20 

*210> sa 
<212> DNA 

<213> Artificial sequence 

<220> » 
<22l> Forward primer for PCR amplification of KGA3 6 1 SSLP marker in 

Arabvriopa i s tJ;.aiinna subspecies 

<4O0> 58 

aaagagatgs gaatttggac 20 

<210> S$ 

<2H> ^3 

.:J>12> DNA 

<213> Artificial sequence 
<220> 

<223> ReverBe primer £or PCR Ampl if i cation of NGA3G1 SSLP marker in 
Arab! dop* iff thai i ana subspecies 

<400> 59 

acatatcaat atattssagt age 23 
Ol0> $0 

<2ii> ia 

<212> DMA 

<2L3> Artificial sequence 

<22Q> 

<223> Forward primer for &CR amplification of HGA16S SSLP marScer in 

Arabidopsi* tnaliana sub*g>©ci a 

<400> 60 



5NSDOCID: <WO 9919492AZTI > 



WO 99/19492 - , 

PCT/EP98/0W77 



40 

<212> DNA 

<2l3sh Artificial sequence 



13 



<220> 
<223> 



Arabidopsi* cha liana subspecies 



<4Q0> 6i 

gaggacatgt ataggagcct eg 

<210> 52 
<211> 20 
<2\2> DNA 

<2X3> Artificial «u»quft»c* 



22 



O20> 



Por^ard primer cor pqr aopiiCicacioa of Athnro^ ssi^ porker 
in Arabitfopsxs r.halian* subspecies 



<400> 62 

tgacctcccc ttccatggag 

<210> 5) 
<21i> 22 
<212> DNA 

<213> Artificial sequence, 



20 



<22Q> 
<223> 



?rr S J-S ri ^ r f° R aw P lifS ««OH of AChBI02 SSLP marker 

in Arabidopeis thaliana BubspecieB 



<40O> 63 
ctascagaaa cccaaagctt cc 

<210> 64 

«2ii> 21 

<212> DMA 

<2X3> Artificial sequence 



22 



<220> 
*2S3> 



ill Arabadopsis thaliana Bubspecies 



^SOOCID: <WO 9919492A2TI > 



WO 99/1949* 



41 





S4 




aqgcaaacgc 


ccdtttcatc g 


21 


<211> 
<21JJ> 


65 
20 
DKA 

Artificial. sequei>ce 




<220> 
<223* 


Reverse primer for PCS* amplification of 
i n Arabi daps i 3 tha 1 i ana subspec i « s 


AT hit 1"R T P/TITT cot r> m o >- 


<400> 


€5 




acgacatggc 


agatttCtCC 


20 


<210> 
<2ll> 
<212> 


21 
DWA 


• 



^213^ Artificial . r ;ftquoiLC» 

*22D> 

-:223> Forward primer tor PCR amplif icar i.Oj* Of NGA172 SSLP marker in 

Arabidopsis thai i ana subspecies 

<40D> 

agctgettcc tcacagcgtc c 21 

<51D> 67 

<211> 1.9 

<212v DNA 

<2l*> Axtifici3l eequetice 
<220> 

<223> Reverse pr invar for Pcr amplification of ucxxi? SSLP marker in 
Arabi dopaia C ha liana sxibBpeciea 

<4Q0> 67 

catccgaatg ceattgfctc 19 

<2l0> 6S 

<211> 2L 

<212> DMA 

<213> Artificial sequence 
<220> 



1NSDOCID: <WO 9919492A2TI > 



WOW/1 9 492 

PCT7BP91M>6977 

42 

Arabviiopsis chaliana subepecies TnarXer xn 

gaaaaaacgc tacttCcytg g 



21 



<211> 22 
c2L2> i> KA 

<213> Artificial sequence 



c220> 
<222> 



Reveree primer for PCR aniplif ication of SSLP ^ 

AraMdopsia tb*u m Species ***** ln 



<4Q0* g$ 

caagagcaat iitcaagagcu gc 

< 2 1 0 > 7 3 

<211> 20 
<2l2> DNA 

^ 2L3> Arcificial /sequence 



t2 



<220> 
<222> 



Po^ard primer for PC* amplif icanion of NGA162 5 ©LP marker in 
Arabidopsie thaliana e»b*pe*ie* WW in 



catgcsattt gcatcugagg 



20 



2lO> 7i 
<211> 22 



<215> DNA 

«21J> Artificial sequence 



<220> 
<223> 



J^T PTim !! 5?* PCR «P""^tioa o£ KGAI62 SSLP marker in 
ArabidopBis thalxana subapeciee 



ccctgtcact ctcttcctct gg 



<21Q> 72 

<211> 21 

<212> OWfr 

<2i2> Artificial sequence 



22 
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v-220> 

<223* Forward primer for PCR amplif tcatica of SSLP marker in 

Arabidopsia thaliana aubsp&cies 

<400> 72 

tggatttcrt cctctcttca c- 21 



<211> 21 
<212> DMA 

<213> Artificial sequence 

<223> Reverse primer for PCR amplification of ITCA6 SSLP marker ia 

Arabidopeis thaliana subspecies 

<400> 73 

m 

atggagaagc t.r.acactgat c 21 

*2l0> ?* 
<211> 20 
*212 > DNA 

<213> Artificial sequence 

--223> Forward primer for PCR amplification of WGA12 SSLP marker in 

Arabidopeis tha liana subspecies* 

<40Q> 74 

aatgttgtcc tcccctcctc 2D 



<r2l0> 75 
<211> 22 
<212> DNA 

<2I3> Artificial sequence 

<223> Reverse prircex for PCR amplification of NGA12 SSLP marker in 

Arabidopeis thaliana aubapecies 

<:4&0> 75 

t gat get etc cgaaacaaga gc 32 
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*210> 76 
<2ll> 21 
i2l2> OKA 

Artificial sequence 

<223> 



gagggc^aac CttrtattCcg g 

<2lG> 77 

<2il> 22 

<212> DWA 

<2i3> Artificial sequence 

<220> 
<225> 



21 



<40D> 77 
tggctttcgt: ttacaaacat cc 

<2I0> 73 

c^l-U 21 

<212> DMA 

<2l3> Artificial sequence 

<22Q> 
<223> 



22 



<400> 79 
gegaaaaaac aaaaaa*tcc a 

<210> 79 

<211> 21 

<r213* DNA 

<213> Artificial sequence 

■q220> 
<223> 



21 



<400> 73 
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egae^aaccg acagaattag g 



21 



<2L1> 
<223> 



<4D0> 



90 
21 
DMA 

Artificial aeguenco 



Forward primer fox PCR amplification NGA22S SSlp marker in 
Arsbidopsifi thaliana subspecies 



gaaatccaaa ccccagagag g 



21 



<2l0> 
<211> 
<212> 
<213> 

<220> 



<*00> 



BI 

Artiririai sequence 



Reverse primer for PCn ampli f iear ion of NGA22S S£LP marker in 
ArsbddopBis r.ha liana subspecies 

*1 



tctccccact agttttgtgt rc 



<210> 
<211> 
*2l2> 
<213> 

«?20> 
<223> 



<400> 



82 
ONA 

Artifi cial Bequeneo 



Forward primer for PCR amplification of NGA24 9 ssi.P roarfcer in 
Arabidopsis thaliana subspecies 

82 



tacegtcaafc ttcatcgcc 



19 



<21Q> 
<211> 
<:212> 
<213> 

<220> 
<223> 



B3 
22 
DNA 

Artificial sequence 



fteveraa prima r for PCX amplification of NGA24 9 SSLP marker in 
Arabidopsid trial iana subspecies 



3NSDOCID: <WO 9919492A2TI > 



WO Wl 

-400 > $3 
ggatccc^a ctgtasaatc cc 

-.212* DNA 

<21 *> Artificial sequence 

<220> 

<2a3> Forward prinusr for pgr 

Arabidopsis thaXiana s^pecfer^ 011 °* ^ SSLP « 
<4O0* e4 

eatcccagta accaaacaca ca 

*210=» 95 

<211» 20 

<2l2> DMA * 

< 2 ^> Artificial .^quonc-e 

<220* 

<223* Reverse primer for PCR 

Mb***,*. Chalilxi^ub^ecfir" 00 ° f SW * 

<40O» 95 

cccagtctaa ccacgaccac 

<2l0> 8fi 

Oil* 20 

<212> DMA 

< 2i3 * Artificial sequence 
<220> 

* Porwerd primer for pre 



20 



<400> 95 
gtctegg^aa gttttgeegg 

<210> B1 
<2H> 24 



20 



<212> 



DNA 



<215> Artificial aequence 

<220* 
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-.223 > 


RttVrt IT £ q ov-imaf for C^r^]? irnnl i f ^rdh<i*,K r*^i* 

Arabidopais t ha liana aubapecies 


NtJ A 1 & 1 


So LP 


mar ke r in 


<*Q0> 










cagtctaaaa 


gcgagagtafc gatg 






24 


<2t0> 


Ba 








<2ll> 


52 








<212> 


DNA 








<212> 


Artificial sequence 








<220> 










<221> 


POJCWardt DrLTfldr for PCft anrali f icat i nw at 3 
Arabidopsia c ha liana subcpACias 






narke r in 


<* DQ> 










gc tatggage 


ttcragggca eg 








^210> 


69 








<2ll> 










<2l2> 


DNA 








<213> 


Arc i f i c i ft 1 sequence 








<220> 












Arabidopais c ha liana subspecies 




OCT D 


marker in 


<40b> 










tgccccatcc 


tgttcttctc 






20 


<210> 


9D 








<21i> 


20 








< 2l2> 


DNA 








<213> 


Artificial sequence 








<220> 










<223> 


Forward primer for PCft aTTOiif ication of 
Arabidopaie thaliana aubapaciea 


NGA13 9 






<.40D> 


90 








agagc caeca 


g a C c cqataa 






2 0 


<210> 


9i 








<211> 


21 








<212> 


DBA 








<213> 


Artificial sequence ^ 
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<22Q> 

' 221> Ar^fS Primer , fQr PCR "PUfleatiDn of sslp ^rk^r in 

Arabidopais tna liana aufc«p*eies 

t400> 32 

ggtttcgtcc cactatccag g 

21 

<21Q> 92 
«211> 22 
<^12> DMA 

<213> Artificial sequence 



<220> 
<223> 



Forward primer for pcr amplification of NGA76 SSLP mar W in 
Ara*a,3op*is thai! an* subepecieB in 



<400> 32 

ggsg^aaatg tcaccctcca cc 



*2X0> 93 

<2X1> 20 

<2I2> DNA 

<213> Artificial eequeace 



22 



<22j> 



R*v«f« primer for s?cr arnplif ication of HGA7G sslp mrkcr in 
Arsbidapaia chaliana *ub«pec±ea 



<40Q> 93 

aggcatggga gacatttacg 

<2l0> 94 

<211> 20 

<212> £*j A 

<213> Artificial sequence 



20 



<220> 
c223> 



f! r T ar ?.! riWe,r f ° r PCR «"PlUi«»tioo of ATHSOl*! SSLP marker 
in ArabidopaiB thaliana sufcepecieB 



<400> 94 

ctccaccaat cacgcaaatg 



20 
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+9 



<210> 
c:2lL> 

c21J> 


$5 
21 
OKA 

Art i f i cia 1 eequen ce 




c220> 
<222> 


Reverse primer for PCR amplification of 
in Arablrfopsis thaliana subspecies 


ATHSC191 SSLP marker 


<400> 


95 




tgatgttgat 


ggagatggtc a 


21 


c211> 
<212> 
<213> 


96 
22 
DMA 

Artificial sequence 




■» A U 

<22*> 


Forward priroer for PCR ampiif icatioii of 
Ar abi dop& is t ha 1 i ana a obspe c ies 


NGAI2 9 SS1»P marker in 


<400> 






tcaggaggaa 


ctaaagtgag <jg 


22 


<210> 
*2ll* 
<212> 
<213> 


97 
22 
DNA 

Artificial aequance 




<3*0> 
<223> 


Revere© primer for PCR amplification of 
Arabidopsis thai iama subapecieo 


w<3Ai2 9 SSLP marker in 


*400> 


97 




cacaccgaag atggtcttga gg 


22 


<210> 
<211> 
<212> 
<213> 


98 
DNA 

Arabidopeis thai i ana ecotype Coluiribia 




<220> 
<223> 


Genomic DNA sequence of ACMSH6 




<40Q> 


97 




fctttttggtt 


get acaata aaggtaeacg gttttatgee atca tacan etatatataa $d 



JNSDOClO. <WO 9919492A2TI_> 



WO 99 '19492 



PCT/£PWD63>77 



50 



aagaaa&gaa 
tcacctttta 
tcttacccct 
gaacagcttt 
ccaaatccgc 
etgctttcac 
E-CaCcggaac 
tttgaacegc 
ggctcttttg 
ggtatctgct 
tctgagccrt 
tttacacata 
ctggaacgtt 
ctttttgggt 
aattcggctc 
Uaaatttat 

ggggctagcc 

tatacaataa 
aaaaaaagtt 
ttttggcgcc 
cactctecct 
aaacccacgg 
Srgcagcggag 
gttttgctgt 
ttccgcgfceg 
ccctgttctc 
agagrgtacta 



ag* tat a tat 
cattggccaa 
aagtttaaea 
accaetctca 
ttgtgacatc 
•acgatccatc 
cgccgtagat 
gacaccgttt 
ttttgatetg 
cggcatcgtc 
gaaattgaga 
cgaagatcgg 
ct tagagatg 
tggattcgtg 
tcggttcggt 
ttcggtttca 
gaaccgaata 
tagaacataa 
gagccctgag 
aatctttccc 
cacaattcca 
c 9?cgactac 
accacgattt 
ttcgaaatct 
tgtcctgccg 
eaatattatg 
atcttcgact 



tgtttctcca 

cctagaac^t: 
tattcctgaa 
ttcttctcca 
gccgcaggrt 
ttccccettt 
ctcagagctg 
tggagaacta 
gatgaattga 
aacgatgaag 
tttacggaga 
attgacgacg 
gtacaccata 
tatgagtcat 
adtcggatcc 
cc3atgcccg 
atcggacggt 
gageatcgtt 
CCCtttCgaa 
aaaaatgeag 
gaagggtttg 
aatgtgaagg 
gtcgatgagg 
tctggattta 
cataagtttg 
ctcttaattt 



tttatcaaac 
ga.taaacgae 
tctccatctt 
ctaectgagt 
atctcgctc t 
etgtctcttc 
tgttcgaacc 
c-gttaaccgc 
ctggttccca 
gagsaaggaa 
acgaagaatg 
attcgatcac 
tgtcattatc 
ttatccgatc 
cat taaaatt 
ottactccca 
at tgactcgc 
catCaaagcc 
tccgccattt 
ttctctcagc 
cgccagagat 
gtttccggcg 
aaggggatgc 
t tagaggaac 
**,ccggctga 
t^aaagtcga 
tctcatcttt 



aaaacaacaa 

cgcaagcaea 
cctctcattg 
ecgtaecat* 
ttccagcttc 
ggcatcgaat 
ttccggatcg 
gtcttgtgtt 
caacgcgaaa 
t cat t gaga g 

rgatttgcag 
'99fctcaatg 
aagctaacea 
gaaccgaatt 
tggctagaaa 
tcaaagagtg 
ctacgacgca 
tcaaaacatc 
cgactttgtc 
atgctgctag 
caaaggcgac 
ggacactcoa 
atccgccggt 
Cgatogagat 
agccggaaga 



gacttttttt 
catttcccaa 
gcctgattag 
atctgtttcg 
atCctcacct 
ttcgtgttaa 
ttctfcaacqg 
cgtaggtct t 
actgctcctg 
atrtcattaa 
gattgtgata 
attttcgtct 
ttaaccaatg 
gttntatata 
aaaattctcg 
attcgaaacc 
gatccaacgg 
aa,cagtcaac 
aggcgaaaat 
gtttctctct 
tttcctccaa 
Cggcgggggc 
gcttctgtac 
ccggagaagg 
gatgcttcgt 
tgttctggag 
agaagattcg 



120 

iao 

240 

3 00 

3«0 

450 

490 

$4 0 

S00 



6 60 



720 

?ao 

900 
9 50 
2020 
1060 
1140 
1200 
1260 
1320 
1360 
1440 
1SO0 
1560 
1520 
159 0 
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1- r-i 1" to t « ^ 


** to to to ^ 

Lgtact eg t c 


^g^gagacr.c 


c ga c tactgc 


attggatcgt 


cgtctacaaa 


1740 


Cut tcayycky 


«- ^ y ay «3.^y — c 


gtt^ttcege 


tgaacgafcee 


atctctatgt 


acgaaggcta 


1800 




t ^*^t a a P h t 


cgx. t> ccaaca 


ac g^caaaac 


tcaagaaaga 


aaccatgett 


1360 




to f— T^T^l B rT~^rti*.t 


gaa.c t c acrsc 


c agt.agaaga 


cataggagta 


ga&ggogatg 


1920 




« if A A a r-t^. 
aydoawa CCa 


gsga t^cyi c 


c aege gc c tc 


tegcttgaag 


cgagtcctgg 


1980 


cty y a acta L 


3«c llcc stag 


^to<X^ to m 

gagga caagg 


ttcctgtatc 


ggactctaac 


aaaaggctga 


2040 




MiJW A P* M J^JBjM pp p» 

ggacccggcx 


tgeggagaga 


agaa&gaagt 


a aacgaagga 


accaaatttg 


2100 




gecc t cccga 


atcagggatg 


ccaatagaag 


Acgtcctgat 


gatccccttt 


2160 


acgatagaaa 


gaccttacac 


ataccacctg 


atgttctcaa 


gaaaatgtct 


gcat£acaaa 


2220 


Bgcaaca uCg 


gagegtea&g 


agtgaat>aca 


eggacattgt 


gcttctcttt 


aaagcggtta 


22SO 




a tctagtgtt 


caat ccatct 


cctcaatgcg 


atttgcccac 


ttacatctgt 


2340 


tcacgtcatg 


ctctfcctcag 


gggaaeiLtt c 


acgagctgta 


Cgagctagat 


gcggaatcag 


2400 


g c, cacaagga 


gcttgactgg 


sagacgacca 


tgagcggtgc 


gggaaaatgc 


agacaggcaa 


24«0 


to to & ^«to to a 

accagt cgaa 


•fiic a act ggee 


dgcttgaatt 


attgtgCcta 


taaattttga 


caeca ecttt 


55^0 


to j-m to to to ^ 

tgtcccaggt . 


tggtaCCtct 


gaaagtggga 


tagatgaggc 


agegcaaaag 


ctabtagc&c 


25B0 


g tgag t aagg 


0^ <fc A « 4to» 4rfto 

^AAccat cat. 


actttatgga 


attegtttae 


tg c t ac t teg 


gctaggatct 


2 S 40 






a c* a t a f* t~ a 


/«t f a <*4#v a t y 

g v cagga tec 


" gagaac tea 


ggacgerec e 


1*1 Art 

37 00 


ft At h rrrh f 


P a P a -a t a to 


ci ittcac ca 


sggagc aa ca 


aa caaaac c c 


0k riPB pL 4Pa PP «pv JpJ pjp 

goacaa c at c 


t *i e 

2 7 ©O 


uy ir g \i yctva 


p^ W ^p>A3 9 p«PV 

cc^SCaayg c 


acac at a c cc 


a gc caacc c w 


Cgccagt tea 


ct g t a gc aa c 


2 9 20 




jkftpji P s- to rrt P 


t. -s.ee ty Lacg 


t ggcacc etc 


a *- 1 1 At ^ 
atwicgrcvc 


tcatggaggc 


*9 HU 


uttv pay v^v 


t g c L^as a 


c c gg a c Big t c 


acatatget c 


ccaacagaaa 


c eagcaege a 


ft 4 rt 


rr a f t" a K => *r 


CttW CClal t 




4» nf- ^ to to .'rl m s 

t gc a c tgaca 


a_ — _ fci 1 | #vto 

cact cgt eg u 


to to to to to s 

c e ct 1 1 tgaa 


J DU ^ 


agat owad ag 


v w ggac gaaL 


cgagcagct & 


gaaacatctg 


a ccaagcaaa 


agecagaggt 


J Oa u 


gctaatactg 


Caagttttct 


tggataggtc 


aaggagagtg 


ttgcagactg 


tttttgatca 


M2Q 


tttccttttc 


cgracattac 


tttoaegctg 


caaccaactc 


aa&ggceatt 


ctggtctgat 


MS0 


tatcagataa 


ccccaaggaa 


gctagttcag 


gtactaactc 


caccaacagc 


aagcgaggga 


32*0 


aacategggc 


ctgacgccgt 


ccatcttctt 


gecataaaag 


aggtttgeta 


tttacttatt 


3300 
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tatcicatca tgttcagttc atccaagtcc 
eeatc a *<jct gfcgtaaagga cttggaatta 
tgcaagaggt tcccttgaaa agacctgttt 
tcttaagtga atcccctact tcctcacaat 
tatcattcct tgttactgca tccccctctt 
tctcgtgtgo tggagaaaat gatagctgat 
tgcteaggaa Ctgcctttgg fctaeategcc 
Cctttacttt agatcaaaat ggagctaca* 
gttgactgtg ctgccttgag gttttgggtt 
gccctcggag cgttactgat gcaggtaagc 
tgacttcctg tgcatatart tgggttqcag 
tttttccagg tctctccaaa sgaagtgtta 
ccagtegtct tgctaaacag aatctaaggc 
atgctgaaat attgcagggc tatcaagaga 
g^caggtacc atttcagtag gcaagctaac 
tctcttaaac attgctaatg tagatgatgt 
Stacagctgg ctccagtacc acaagtaatg 
ataatagaat ctaacggata ctttaaaggt 
ggtctaaatg aatgtgatgt tgcccttagt 
^ggccaaagg tgtgttggct tgtttagttt 
ttcataactt acagtttcta tctacttgca 
ttctccacac caagtttaca ggggttgtct 
tgagatattt aacaatagct gtgatggegg 
ataacctcaa ccagagggca gacatagaag 
agcaagtaac agccgatttt tgttcaccca 
ctgtgttagt ccaactggta agcgacrett 
tgtagaaagc atcaataaac ggcttgatgc 
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fcgaaaaatta 
gaaaatcatt 
aagattcttt 
gaccatagtc 
tctCaatgac 
ccaagctgta 
taatggtctg 
aagtgttcaa 
gggtccatea 
aa gt??tattc 
gaactaatcc 
tatgacagta 
aaatgacact 

agcacaaaag 
tgacaatcta 
ttatgtccca 
ggggatacag 
tcttctgaat 
gctcttggag 
ttgcttttca 
gctagaagat 
cagaattgat 
tccttcaggc 
gaaaaattct 
tgtagggacc 
aaggaatcgg 
agttgaagaa 



cactcttctt 
atttgatgcn 
grcacccgaaa 
tgcaactgca 
cat tg tat at 



cat t a teat g 
atgtcaattt 
ctgegtatgg 
gcgatgatgc 
tgtatctcac 
tgaatcacca 
aaggtaaact 
ggttaattta 
3«tctaagiga 
accgctcacc 
atctaatagg 
atgctgctgg 
catggaactg 
agctaactaa 
caaattaagc 
gtacttaagc 
ggecagaega 
aagegcatat 
•aatacttcgt 
ttgtacaaat 
atctgccatc 
steaeggcaa 



taccaatctt 
Ctgttttata 
aactcaatct 
tgtcaagtaa 
gttgtgtttg 
attaagtagc 
ttcttctgaa 
atttgetttt 
atcatgtgct 
gtgtaccatg 
tctggtatgt 
SCttgtatcg 
aagtgcatac 
aatatacget 
gaatgatagg 
gtctacggcg 
agttagaasc 
tgctgttgat 
tcatctgtct 
aaaggaacte 
atggggatat 
tggtaaatct 
ttcttttttg 
acggatctcc 
atcttgataa 
cactcaaaga 
actcagaaag 



33S0 
3420 
3430 
3S4g 

366Q 
3720 
37B0 
3840 



2|00 



3960 
4020 
4030 
4140 

42*0 
43-20 
4380 
4440 
4500 

4650 
4580 
4740 
4800 
4860 
4*20 
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^ rrf^ a j « 




at etc cacaa 


acctccagac 


Ctagaaagac 


tgctcggacg 




r> t r* a^frtr^t" 




wa v w cc c, c 


tgtgttgcct 


gctctcctgg 


ggaaaaaagc 


5040 


^ l. y na, a v« Dia 






gttttccgag 


Caatgccttc 


catgagcagt 


5100 


i* Layg«v <-» an 


ft A f* A 1* h 3k r**Crrfr 




ag ac tg t t ct 


ccttcbtttg 


caatgtctgg 


5L«D 


t hot t rflt t a 




laCL ESI HgC 


attgeaggtt 


aaagcatttg 


ggcaaattgt 








u &ga & c x«g c t 


gttggctcta 


cagaaggaat 


caaatatgat 






a a 3 9 ^ ^ 


gcaaac t r ec 


tatattagta 


ggaaaaagcg 


ggctagagtt 


5540 


aLLLuuLLCL 


caac Lcgaay 


cagecataga 


Cagcgacttt 


ccaaattatc 


aggtgeccae 


5400 


* ^ »*<4» I* N MB 

C CCL C C CA 


LJLCCC C ACcLa 


Caaaat gt, ct 


gtc acta etc 


aaagcaatgc 


atatggecta 


54 60 


w s .r^4~ » ^ «■ 

yot Cu CaaC C 


c ac ac cccga 


ggaccctftaa 


gggatttgcC 


tit tattcct 


aatgtttttg 


S520 


ga t gg t c t ga 


L t tattcct a 


actcgaact £ 


attaatcttg 


taccagaacc 


aagatgtgac 




C ^ aaaa C 


gccgaaactc 


tcacaat Act 


tatcgaactc 


tttatogaaa 


g^gcaactca 


SS40 


a £>gg c c eg ag 


gt C3 1 1 caca 


cc a taagc tg 


cctagAtgtc 


ctgagatctt 


ttgeaatege , 


5*? DO 


ag C33gt etc 




gcatggccag 


gcctgttatt 


tttcccgaat 


cagaagctac 


5750 
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